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(54) AUDIO DATA STRUCTURE, RECORDING MEDIA, AND PROCESSOR 



(57) In audio contents which have cells for defining 
an audio title playback unit and whose actual playback 
sequence is determined by defining the playback 
sequence of the cells, cell information to specify the 
cells is provided with identification information to identify 
the types of the cells according to the difference in the 



contents of the data included in the cells. One type of 
the contents of the data in the cells is for obtaining the 
length of the silent period of time. The identification 
information corresponding to the cell indicates a silent 
cell. 



AUDIO DATA CELLS 



r AUDIO CELLS (A_C) 
CD-I] 

SILENT CELLS (SI_C) 
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Description 

Technical Field 

[0001] This invention relates to an audio data struc- 
ture which facilities the handling of high sound-quality 
audio data in processing (recording, reproducing, trans- 
mitting, and constructing) the data and fulfills the high 
sound-quality requirement, a recording medium for the 
audio data structure, and an apparatus for processing 
its signal. 

Background Art 

[0002] DVD video disks are optical disk on which 
video (moving picture) information can be recorded very 
densely with high quality and further various types of 
information, including multiangle pictures, sub-pictures, 
multilingual voice, and multichannel audio, can be 
recorded. Such DVD video disks have been developed 
and on the market and are finding their way into wide 
application. (DVD is an abbreviation for Digital Versatile 
Disk.) 

[0003] The specifications for DVD video disks cover 
not only compressed multichannel audio (including AC- 
3 and MPEG) but also uncompressed linear PCM 
(including 48 kHz sampling. 16-bit quantization, 96 kHz 
sampling, and 24-bit quantization). The DVD video lin- 
ear PCM meets the high-frequency sampling, high- 
number bit, high sound quality specifications, which sur- 
pass those for conventional music CDs (with 44.1 kHz 
sampling and 16 bit quantization). Linear PCM with 96- 
kHz sampling and 20- to 24-bit quantization is suffi- 
ciently qualified for the next generation digital audio 
disks (what is called super CDs or super audio disks). 
[0004] However, the DVD video specifications have 
been determined by video requirements rather than 
audio requirements. In terms of not only sampling fre- 
quency and the number of quantization bits but also the 
number of recordable channels and recordable time, 
audio-oriented specifications surpassing the DVD video 
sound specifications have been expected. 
[0005] To meet the expectation, DVD audio specifica- 
tions have been studied (it should be noted that the DVD 
audio specifications have not been in the prior art yet). 
The DVD audio specifications have been considered to 
be capable of supporting linear PCM with 48- to 96-kHz 
sampling and 24-bit quantization employed in the DVD 
video specifications up to linear PCM with 192-kHz 
sampling and 24-bit quantization. Moreover, future ver- 
sions of the DVD audio specifications might introduce 
much higher sound quality. 

[0006] The reason why the DVD audio provides 
upward compatibility is that it has a part shared with 
DVD video that can record a large volume of data cover- 
ing high-definition television images. Moreover, the DVD 
audio is characterized by having technical, marketable, 
and economical advantages in the future when it can be 



used as a result of advance in the DVD video. 
[0007] For example, when high-capacity DVD disks to 
be put in practical use in future DVD video are used in 
DVD audio, if the recording time is constant, there is a 

5 possibility that the sampling frequency in recording, the 
number of quantization bits, and the number of record- 
ing channels will be increased more and more. In addi- 
tion, the technique for DVD video recorders using DVD- 
RAMs (or rewritable DVD-RW or write-once DVD-R) to 

10 be put in practical use in the near future can be used in 
DVD audio recorders to come in practice soon or later. 
[0008] Furthermore, as the popularization of DVD 
video expands its market DVD video and DVD audio 
share increasingly more of the recording mediums 

is (including DVD-ROM, disks. DVD-RAM/DVD-RW disks, 
and DVD-R disks), unit parts (including disk drives, opti- 
cal pickups, and various types of ICs), and various con- 
trol programs. This accelerates the cost reduction of 
DVD audio products featuring high sound quality and 

20 other advantages. When DVD audio is used widely. 
DVD video will enjoy the future technical, marketable, 
and economical advantages available as a result of 
advance in DVD audio. 

[0009] As described above, the development of DVD 

25 audio has been expected, but, as seen from the afore- 
mentioned DVD video, DVD audio with various functions 
and performances will possibly be proposed and devel- 
oped as a result of a high-density recording disk having 
been developed. Specifically, there is a possibility that 

30 DVD audio with a different data structure in terms of 
sampling frequency, the number of quantization bits, 
and the number of channels will be produced. Moreo- 
ver, DVD audio with a different data structure in terms of 
functions, such as DVD audio with or without menu 

35 images, or DVD audio with or without background 
images, will possibly be produced. 
[001 0] Accordingly, an object of the present invention 
is to provide a data structure that enables audio 
attributes to be specified track by track. The data struc- 

40 ture makes it possible to allow the reproduction side to 
deal with DVD audio easily even if various functions and 
performances are included in the DVD audio. 
[0011] The reproduction side needs a preparation 
time for changing the hardware devices according to the 

45 change of the attributes. The preparation time causes a 
break in the sound output. Accordingly, another object 
of the present invention is to provide a data structure 
which positively recognizes a break in sound and allows 
the designer or producer to set a sound break period 

so arbitrary. The data structure makes it possible to make 
silent periods between pieces of music constant when, 
for example, a DVD audio disk is played back, which 
provides the user with a stable playback condition. 

55 Disclosure of Invention 

[001 2] To achieve the foregoing objects, identification 
information to identify the type of cells by the difference 
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in the contents of the data included in the cells is pro- 
vided in cell information to specify the cells in audio con- 
tents that have cells defining at least an audio title 
playback unit and determines the actual playback 
sequence by defining the playback sequence of the 5 
cells. This enables the data structure creator side to 
deliberately realize data processing management tim- 
ing management, and setting management on the 
reproducing apparatus side during playback according 
to the contents of the data on the basis of the identifica- w 
tion information. 

[001 3] One type of the contents of the data in the cells 
is for determining the length of a silent period of time. 
The identification information corresponding to the cell 
is characterized by indicating a silent cell. Providing a is 
silent cell for determining the time of the silent period 
enables a silent period to be set. Using the silent period, 
the reproducing apparatus side can change or set the 
attributes. In a case where tracks with a break in sound 
are mingled with tracks without a break in sound, effec- 20 
tive use of silent cells at the head of a track without a 
break in sound makes it possible to edit the data in such 
a manner that a uniform pause period giving no unnatu- 
ral feeling on the whole is taken. 

25 

Brief Description of Drawings 
[0014] 

FIG. 1 is a perspective view to help explain the con- 30 
figuration of an optical disk usable as a DVD audio 
recording medium; FIG. 2 is a view to help explain 
the correlation between the data recording area on 
the optical disk of FIG. 1 and recording tracks of 
data recorded in the area; FIG. 3 is a diagram to 35 
help explain the hierarchical structure of the infor- 
mation recorded in a DVD audio zone among the 
various types of information recorded on the optical 
disk of FIG. 2; FIG. 4 is a diagram to help explain an 
example of the data structure of AOTT_AOBS 40 
among the pieces of information recorded in the 
DVD audio zone of FIG. 3; FIG. 5 is a diagram to 
help explain the hierarchical structure of the infor- 
mation recorded in a DVD video zone among the 
various types of information recorded on the optical 45 
disk of FIG. 2; FIG. 6 is a diagram to help explain an 
example of the data structure of VTSTT_VOBS in 
the information recorded in the DVD video zone of 
FIG. 5; FIG. 7 is a diagram to hetp explain an exam- 
ple of video information (e.g.. VTS_C #2) accessed so 
by both program chain information (ATS_PGCI) in 
the DVD audio zone of FIG. 3 and program chain 
information (VTS_PGCI) in the DVD video zone of 
FIG. 5: FIG. 8 is a diagram to help explain an exam- 
ple of the data structure that is about the recorded ss 
contents of user-accessible DVD audio and is 
recorded on one side of the optical disk shown in 
FIG. 1 ; FIG. 9 is a diagram to help explain an exam- 



ple of the directory structure of the information (in 
data files of DVD audio and DVD video) recorded 
on the optical disk of FIG. 1 ; FIG. 1 0 is a diagram to 
help explain another example of the directory struc- 
ture of the information (in data files of DVD audio 
and DVD video) recorded on the optical disk of FIG. 
1; FIG. 11 is a cfiagram to help explain a case where 
the directory on the audio content side accesses a 
file in the directory on the video content side in the 
directory structure shown in FIG. 9; FIG. 12 is a dia- 
gram to help explain a case where a f ae in the direc- 
tory on the audio content side links with a fie in the 
directory on the video content side in the directory 
structure shown in FIG. 9; FIG. 13 is a diagram to 
help explain an example of how file accessing of 
FIG. 1 1 is effected in the volume spaces shown in 
FIG. 3 of 5; FIG. 14 is a diagram to hetp explain 
another example of how ffle accessing of FIG. 1 1 is 
effected in the volume space shown in FIG. 3 of 5; 
FIG. 15 is a diagram to help explain still another 
example of how file accessing of FIG. 1 1 is effected 
in the volume space shown in FIG. 3 or 5; FIG. 16 
is a diagram to help explain the recorded contents 
of audio manager information (AMGI) in the DVD 
audio zone shown in FIG. 3; FIG. 17 shows the 
recorded contents of the audio manager informa- 
tion management table (AMGI_MAT) included in 
the audio manager information (AMGI) shown in 
FIG. 16; FIG. 18 is a diagram to help explain the 
contents of the audio title search pointer table 
(ATT_SRPT) included in the audio manager infor- 
mation (AMGI) shown in FIG. 16; FIG. 19 is a dia- 
gram to help explain the contents of the audio title 
search pointer (ATT_SRP) included in the audio 
title search pointer table (ATT_SRPT) shown in 
FIG. 18; FIG. 20 is a diagram to help explain the 
contents of the audio-only title search pointer table 
(AOTT_SRPT) included in the audio manager infor- 
mation (AMGI) shown in FIG. 16; FIG. 21 is a dia- 
gram to help explain the contents of the audio-only 
title search pointer (AOTT_SRP) included in the 
audio-only title search pointer table (AOTT_SRPT) 
shown in FIG. 20; FIG. 22 is a table showing the 
relationship between a group of audio-only titles 
(AOTT_GR) accessed using the audio-only title 
search pointer (AOTT_SRP) in the audio manager 
information (AMGI) shown in FIG. 1 6 and a group of 
audio titles (ATT_GR) accessed using the audio 
title search pointer (ATT_SRP) in the audio man- 
ager information (AMGI): FIG. 23 is a diagram to 
help explain the recorded contents of an audio title 
set (ATS) in the DVD audio zone shown in FIG. 3; 
FIG. 24 shows the recorded contents of the audio 
title set information management table (ATSI_MAT) 
included in the audio title set information (ATSi) 
shown in FIG. 23; FIG. 25 is a diagram to hetp 
explain the contents of the audio title set program 
chain information table (ATS_PGCIT) included in 
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the audio title set information (ATSI) shown in FIG. 
23; FIG. 26 is a table showing the contents of the 
audio title set program information (ATS_PGI) 
shown in FIG. 25; FIG. 27 is a table showing the 
contents of the audio title set cell playback informa- 
tion (ATS_C_PBI) shown in FIG. 25; FIG. 28 is a 
diagram to help explain the contents of the audio 
title set audio still video playback information table 
(ATS_ASV_PBIT) shown in FIG. 25; FIG. 29 is a 
table showing the contents of the audio title set pro- 
gram audio still video playback information search 
pointer (ATS_PG_ASV_PBI_SRP) shown in FIG. 
28; FIG. 30 is a block diagram of an apparatus for 
reproducing the recorded information in the DVD 
audio zone of FIG. 3 or the recorded information in 
the DVD video zone of FIG. 5 from the optical disk 
of FIG. 1 ; FIG. 31 is a front view of an example of 
the front panel of the reproducing apparatus of FIG. 
30; FIG. 32 describes the types of audio data cells 
forming the important part of the present invention; 
FIGS. 33A and 33B are diagrams to help explain 
the types of audio title set program and the data 
allocation structure; FIG. 34 is a diagram to help 
explain an example of audio-only data pack trains in 
the audio-only title; FIG. 35 is a diagram to help 
explain pack trains when audio-only title audio and 
real-time data are present; FIG. 36 is a diagram to 
help explain pack trains when audio-only title audio 
cells and a silent cell are present; FIGS. 37A to 37C 
are diagrams to help explain a pack train where pro- 
grams in front of and behind a program have the 
same attributes and are composed of only audio 
cells, and the change of the presentation time 
stamp and the change of the playback time caused 
by the playback sequence; FIGS. 38A to 38C are 
diagrams to help explain a pack train where pro- 
grams in front of and behind a program have differ- 
ent attributes and are composed of audio cells 
including si lent cells, and the change of the presen- 
tation time stamp and the change of the playback 
time caused by the playback sequence; FIG. 39 is a 
block diagram of another example of the disk repro- 
ducing apparatus according to the present inven- 
tion; and FIG. 40 is a block diagram of still another 
example of the disk reproducing apparatus accord- 
ing to the present invention. 

Best Mode of Carrying Out the Invention 

[0015] Hereinafter, referring to the accompanying 
drawings, an embodiment of the present invention wilt 
be explained. This invention relates to an audio data 
structure which facilitates the handling of high sound 
quality audio data and assures the high sound quality in 
processing (recording, reproducing, transferring, and 
constructing) the high sound quality audio data, a 
recording medium thereof, a processing apparatus 
thereof, and a processing method thereof. 



[001 6] In the embodiment explanation will be given as 
to a case where the present invention is applied to a 
system where the objects of contents (including various 
video contents and various audio contents) are shared. 

5 In addition, explanation will be given as to a case where 
the invention is applied to an information recording 
medium with management data used to share the 
objects of contents, an apparatus for reproducing the 
recorded information from the medium, a method of 

io recording information including the management data 
on the medium, and a method of reproducing the infor- 
mation from the medium on the basis of the manage- 
ment data. 

[0017] FIG. 1 is a perspective view showing the con- 
is figuration of an optical disk 10 that can be used as a 
DVD audio recording medium. As shown in FIG. 1 , the 
optical disk 10 is such that two transparent substrates 
14 on each of which a recording layer 1 7 is provided are 
laminated together with an adhesion layer 20. Each 
20 substrate 14 is made of 0.6-mm-thick polycarbonate. 
The adhesion layer 20 is made of very thin (for example, 
40-^im-thick) ultraviolet-curing resin. The two 0.6-mm- 
thick substrates 14 are laminated together in such a 
manner that the recording layer 1 7 of each substrate is 
25 in contact with one surface of the adhesion layer 20, 
which produces a 1.2-mm-thick large-capacity optical 
disk 10. 

[001 8] In the optical disk 1 0, a central hole 22 is made. 
Around the central hole 22 on both sides of the optical 

30 disk 10, clamp areas 24 for clamping the optical disk 1 0 
during rotation are provided. When the optical disk 10 is 
loaded into a disk drive unit (not shown), the spindle of 
a disk motor is inserted in the central hole 22. While the 
optical disk 10 is rotating, the disk is clamped by disk 

35 clampers (not shown) in the clamp areas 24. 

[001 9] The optical disk 1 0 has an information area 25 
around the clamp areas 24 in which video data, audio 
data, and other pieces of information can be recorded. 
[0020] In the information area 25. a lead-out area 26 

40 is provided at the outer edge and a lead-in area 27 is 
provided at the inner edge adjacent to the clamp area 
24. A data recording area 28 is defined between the 
lead-out area 26 and lead-in area 27. 
[0021] In the recording layer (light reflecting layer) 17 

45 of the information area 25, recording tracks are formed 
continuously, for example, in a spiral. The continuous 
tracks are divided into physical sectors. Serial numbers 
are allocated to the sectors. Using the sectors as 
recording units, various types of data are recorded on 

so the optical disk 10. 

[0022] The data recording area 28 is an actual data 
recording area including a DVD audio data recording 
area and a DVD video data recording area (the DVD 
video data recording area might not be used in a pure 

55 audio disk). 

[0023] In the DVD audio data recording area, audio 
data is chiefly written as recording and playback infor- 
mation in the form of pit trains (or in a physical shape or 
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phase that optically changes the laser reflected light). 
Depending on the situation, still picture data may be 
recorded in the DVD audio data recording area. The 
audio data recorded in the DVD audio data recording 
area can include completely silent data (not a silent por- 
tion in music but intentionally silent data). 
[0024] On the other hand, in the DVD video data 
recording area, video data (main picture data) for mov- 
ies, sub-picture data for subtitles and menus., and autio 
data for words and sound effects are recorded as 
recording and playback information in the form of pit 
trains. 

[0025] When the optical disk 10 is a single-sided sin- 
gle layer, double-sided recording DVD-RAM disk (or a 
rewritable disk; DVD-RW disk), each recording layer 17 
is composed of a triple layer formed by sandwiching a 
phase change recording material (e.g.. GegSt^Tes) 
between two zinc sulfide * silicon oxide mixtures 
(ZnS-Si0 2 ). 

[0026] When the optical disk 10 is a single-sided sin- 
gle layer, single-sided recording RAM disk, the record- 
ing layer 1 7 on the reading face 19 side is composed of 
a triple layer including the phase change recording 
material layer. In this case, the layer 17 located on the 
opposite side when viewed from the reading face 19 
side need not be an information recording layer. It may 
be a simple dummy layer. 

[0027] When the optical disk 1 0 is a single-sided read- 
ing dual-layer RAM/ROM disk, two recording layers 17 
are composed of a single phase change recording layer 
(the rear side viewed from the reading face 19; for read- 
ing) and a single translucent metal reflecting layer (the 
front side viewed from the reading face 19; for play- 
back). 

[0028] When the optical disk 10 is a write-once DVD- 
R, polycarbonate is used for a substrate. Gold may be 
used for a reflecting film (not shown) and an ultraviolet- 
curing resin may be used as a protective film (not 
shown). In this case, organic pigment is used for the 
recording layer 17. Cyanine, squarilium, chroconic, 
triphenylmethane dyes, xanthene, quinone dyes (e.g., 
naphthoquine or anthraquinone), and metal complex 
dyes (e.g.. phthalocyartne, porphyrin, drthiol Complex, 
and the like) may be used as the organic pigment. 
[0029] Data can be written onto such a DVD-R disk 
using, for example, a semiconductor laser with an out- 
put of about 6 to 1 2 mW at a wavelength of 650 nm. 
[0030] When the optical disk 10 is a single-sided read- 
ing, dual-layer ROM disk, two recording layers 17 are 
composed of a single metal reflecting layer (at the back 
viewed from the reading face 19) and a translucent 
metal reflecting layer (at the front viewed from the read- 
ing face 19). 

[0031] In a read-only DVD-ROM disk (for DVD audio 
and/or DVD video), pit trains are formed by a sta mper 
on a substrate 1 4 in advance. On the surface of the sub- 
strate 14 on which the pit trains have been formed, a 
reflecting layer of metal or the like is formed. The reflect- 



ing layer is used as the recording layer 17. In such a 
DVD-ROM disk, groups serving as recording tracks are 
normally not provided. Instead, the pit trains formed at 
the surface of the substrate 14 function as tracks. 

5 [0032] In the various types of optical disk 1 0, the play- 
back-only ROM information is recorded as an emboss 
signal in the recording layer 17. In contrast, an emboss 
signal is not recorded on the substrate 14 having the 
read/write (or write-once) recording layer 1 7. Instead, a 

io continuos groove is inscribed. The groove is provided 
with phase change recording layers and others. In the 
case of a read/write DVD-RAM disk, phase change 
recording layers in the land portions as well as the 
groove are used for information recording. 

is [0033] When the optical disk 1 0 is of the single-sided 
reading type (with either one or two recording layers), 
the substrate 14 on the reverse side viewed from the 
reading face 19 is not necessarily transparent to a 
read/write laser beam. In this case, a label may be 

20 printed on the whole surface of the reverse-side sub- 
strate 14. 

[0034] FIG. 2 is a diagram to help explain a correlation 
between the data recording area 28 on the optical disk 
10 of FIG. 1 and recording tracks of data items recorded 

25 there. When the optical disk 10 is a DVD- RAM (or DVD- 
RW), the body of the optical disk 10 is housed in a car- 
tridge (not shown) to protect the delicate disk surfaces. 
When the DVD- RAM disk together with the cartridge is 
inserted in the disk drive of a DVD player explained 

30 later, the optical disk 10 is drawn out of the cartridge 
and clamped to the turn table of a spindle motor (not 
shown). Then, the disk is rotated in such a manner that 
it faces an optical head (not shown). 
[0035] On the other hand, when the disk 10 is a DVD- 

35 R or a DVD-ROM, the body of the optical disk 10 is not 
housed in a cartridge. The raked optical disk 10 is set 
directly in the disk tray of the disk drive. 
[0036] On the recording layer 17 in the information 
area 25 of FIG. 1 , data tracks are formed continuously 

40 in a spiral. As shown in FIG. 2. the continuos tracks are 
divided into logical sectors (the minimum recording 
unit), each sector having a specific storage capacity. 
Data is recorded in logical sectors. The storage capacity 
of one logical sector is set at 2048 bytes (or 2 kilobytes) 

45 equal to the data length of one pack. 

[0037] The data recording area 28 is an actual data 
recording area, in which management data and sound 
data have been recorded for DVD audio and similarly 
management data, main picture (video) data, sub-pic- 

so ture data, and sound data have been recorded. 

[0038] Although not shown, when the optical disk 10 
of FIG. 2 is a DVD-RAM disk, the data recording area 28 
may be divided into ring-like (annual-ring-like) recording 
areas (recording zones). In this case, the angular speed 

55 of the disk rotation differs from one recorcfing zone to 
another. In each zone, however, the linear speed or 
angular speed can be made constant. When the optical 
disk 1 0 of FIG. 2 is a DVD-ROM disk, various data items 
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are recorded on all of the data recording area 28 at a 
constant linear speed. 

[0039] FIG. 3 is a diagram to help explain the hierar- 
chical structure of those recorded in the DVD audio 
zone among the various pieces of information recorded s 
on the optical disk 10 of FIG. 2. In FIG. 3, the data 
recording area 28 formed on the optical disk 10 has a 
structure shown in the figure. A logical format in the 
structure is determined so as to comply with, for exam- 
ple, ISO 9660, one of the standards, and the universal io 
disk format (UDF) bridge. 

[0040] The data recording area 28 between the lead- 
in area 27 and the lead-out area 26 is allocated as a vol- 
ume space 28. The volume space 28 can include a 
space (volume/File structure area 70) for information on is 
volume and file structure, a space (DVD audio zone 71 
and DVD video zone 72) for applications complying with 
the DVD standard, and a space (other recording areas 
73) for applications other than those complying with the 
DVD standard. 20 
[0041 ] The volume space 28 is physically divided into 
a large number of sectors. Serial numbers are allocated 
to the physical sectors. The logical addresses for the 
data items recorded in the volume space 28 mean logi- 
cal sector numbers as determined in ISO 9660 and the 25 
UDF bridge. Like the effective data size of a physical 
sector, the size of a logical sector is set at 2048 bytes (2 
kilobytes). The logical sector numbers are serial in such 
a manner that they correspond to ascending order of 
physical sector number. 30 
[0042] Unlike a logical sector, a physical sector is 
given redundant information, such as error correction 
information. To be precise, the physical sector size 
therefore does not correspond to the logical sector size. 
[0043] As shown in FIG. 3, the volume space 28 35 
includes a volume/file structure area 70, a DVD audio 
zone 71, a DVD video zone 72, and other recording 
areas 73. These areas 70 to 73 are separated at the 
boundaries of logical sectors shown in FIG. 2. Here, one 
logical sector is defined as containing 2048 bytes. One 40 
logical block is defined as containing 2048 bytes. Con- 
sequently, one logical sector is defined in the same 
manner as one logical block. 

[0044] The volume/Tile structure area 70 corresponds 
to the management area determined in ISO 9660 and 45 
the UDF bridge. On the basis of the description in the 
area 70, the contents of the audio manager (AMG) 71 1 
are stored in the system memory in the DVD player 
explained later. 

[0045] The DVD audio zone 71 is composed of a sim- so 
pie audio manager (SAMG) 710. an audio manager 
(AMG) 711, an audio still video set (ASVS) 712, and 
one or more audio title sets (ATS #m) 713 (the maxi- 
mum number of audio title sets m is 99). 
[0046] The SAMG 710 is a single file containing 128 55 
kilobytes, into which a simple audio play pointer table 
(SAPPT) with the same contents has been written eight 
times. 



[0047] The AMG 71 1 is composed of an audio man- 
ager information (AMG I) file 7110, an audio manager 
menu video object set (AMGM_VOBS) file 71 1 1 , and an 
audio manager information backup (AMGI_BUP) file 
7112. The AMGM_VOBS file 7111 is an optional file 
and may be absent. 

[0048] The ASVS 712 is composed of an audio still 
video set information (ASVSI) file 7210. an audio still 
video object set (ASVOBS) file 7121, and an audio still 
video set information backup (ASVSI_BUP) file 7122. 
[0049] Each ATS 71 3 is composed of an audio title set 
information (ATSI) file 7130, an audio-only title audio 
object set (AOTT_AOBS) file 7131. and an audio title 
set information backup (ATSI_BUP) file 7132. Here, the 
AOTT_AOBS file 7131 is made up of one to nine files. It 
is an optional file and may be absent. 
[0050] Referring to FIG. 4. AOTT_AOBS 7131 will be 
explained. As explained later, AOTT_AOBS 7131 
defines a set of one or more audio objects (AOB). Each 
AOB defines a set of one or more audio title set cells 
(ATS_C #). A set of one or more cells constitutes an 
audio title set program. A set of one or more programs 
constitutes an audio title set program chain (PGC). 
[0051] In FIG. 3, the structure of AOTT_AOBS 7131 is 
represented directly by a set of ATS_C #. Each PGC is 
expressed by the program chain information in the ATS. 
[0052] When one PGC is compared to an opera, cells 
constituting the PGC correspond to various music 
scenes or singing senses in the opera. The contents of 
the PGC (or the contents of the cells) are determined by 
a software provider that creates the contents recorded 
on an optical disk 10. Specifically, the provider can 
reproduce the cells constituting AOTT_AOBS 7131 as it 
has planned, using cell playback information 
ATS_C_PBI written in program chain information 
ATS_PGCI in the ATS. Explanation of ATS_PGCI and 
ATS_C_PBI will be given later. 
[0053] In the other recording areas 73, usable pieces 
of information in the DVD video zone 72 or other pieces 
of information unrelated to the DVD video zone 72 can 
be written. The recording area 73 is not indispensable 
and may be eliminated, if unnecessary. 
[0054] FIG. 5 is a diagram to help explain the hierar- 
chical structure of those recorded in the DVD video 
zone 72 among the various pieces of information 
recorded on the optical disk 10 of FIG. 2. Hereinafter, 
what has been explained in FIG. 3 win be omitted and 
only the part related to the DVD video zone 72 will be 
explained. 

[0055] On the basis of the description in the vol- 
umeAile structure area 70. the contents of the video 
manager (VMG) 721 are stored in the system memory 
in the DVD player explained later. 
[0056] The DVD video zone 72 is composed of a video 
manager (VMG) 721 and one or more video title sets 
(VTS #n) 722 (the maximum number "n" of video title 
sets is 99). 

[0057] The VMG 721 is composed of a video manager 



11 



EP 0 986 060 A1 



12 



information (VMGI) file 7210, a video manager menu 
video object set (VMGM_VOBS) file 7211, and a video 
manger information backup (VMGLBUP) file 7212. 
Here, the VMGM_VOBS file 721 1 is an optional file and 
may be absent 

[0058] Each VTS 722 is composed of a video title set 
information (VTSI) file 7220, a video title set menu video 
object set (VTSM_VOBS) file 7221 , a video title set title 
video object set (VTSTT_VOBS) file 7222. and a video 
title set information backup (VTSI_BUP) file 7223. Here, 
the VTSM_VOBS file 7221 is an optional file and may 
be absent 

[0059] Stored in each video title set (VTS) 722 are not 
only the video data (or video packs explained later) 
compressed according to the MPEG standard, the 
audio data (audio packs explained later) compressed or 
uncompressed according to a specific standard, and 
run-length-compressed sub-picture data (sub-picture 
packs explained later, including bit-map data whose one 
pixel is defined using plural bits), but also information 
used to reproduce these data items (navigation packs 
explained later, including presentation control informa- 
tion and data search information). 
[0060] Referring to FIG. 6. VTSTT_VOBS 7222 will be 
explained. As explained later, VTSTT_VOBS 7222 
defines a set of one or more video objects (VOB). Each 
VOB defines a set of one or more video title set cells 
(VTS_C #n). VTS_C #n is composed of one or more 
video object units (VOBU). A VOBU may include navi- 
gation packs, audio packs, and sub-picture packs. A set 
of one or more video title set cells (VTS_C #n) consti- 
tutes a video title set (VTS) program. A set of one or 
more programs constitutes a video title set (VTS) pro- 
gram chain (PGC). 

[0061] FIG. 5 shows the relationship between a pro- 
gram chain (PGC) and video title set cells (VTS_C #n). 
[0062] When one PGC is compared to a drama, cells 
constituting the PGC can be considered to correspond 
to various scenes in the drama. The contents of the 
PGC (or the contents of the cells) are determined by a 
software provider that creates the contents recorded on 
an optical disk 10. Specifically, as with the ATS_PGCI 
explained in FIG. 3, the provider can reproduce the cells 
constituting VTSTT_VOBS 7222 as it has planned, 
using the cell playback information (not shown) written 
in program chain information (VTS_PGCI) in the VTS. 
[0063] FIG. 7 is a diagram to help explain a case 
where specific pieces of video information (VTS_C #2, 
VTS_C #3, VTS_C #5) are accessed (in different meth- 
ods) by both the program chain information (ATS_PGCI) 
in the DVD audio zone 71 of FIG. 3 and the program 
chain information (VTS_PGCJ) in the DVD video zone 
72 of FIG. 5. In other words. FIG. 7 shows a case where 
the same video objects (VOB) are referred to in different 
methods by the audio reproducing unit and video repro- 
ducing unit. 

[0064] Specifically, when video playback is carried out 
from the video title set (VTS) side, cells VTS_C #1 to 



VTS_C #6 in the VOB are reproduced in sequence on 
the basis of the cell playback information (not shown) in 
the VTS_PGCI. 

[0065] "On the other hand, when video playback (or still 

5 playback) is carried out from the audio title set (ATS) 
side, cells VTS_C #2. VTS_C #3. and VTS_C #5 in the 
VOB are selectively reproduced on the basis of the cell 
playback information (ATS_C_PBI) in the ATS_PGCI. 
[0066] In this case, because neither the ATS nor the 

w VTS needs to have the same cell data items (VTS_C 
#2, VTS_C #3, and VTS_C #5) separately on the same 
optical disk 10, it is possible to use the limited storage 
capacity of the optical disk 1 0 effectively. 
[0067] FIG. 4 shows an example of the data structure 

is of the recorded contents (AOTT_AOBS) in the DVD 
audio zone 71 of FIG. 3. The AOTT_AOBS 7131 
explained in FIG. 3 defines a set of one or more audio 
objects (AOTT_AOB #) as shown in FIG. 4. Each 
AOTT_AOB defines a set of one or more audio title set 

20 cells (ATS_C #). A set of one or more cells (ATS_C #) 
constitutes a program. A set of one or more programs 
constitutes a program chain (PGC). This PGC consti- 
tutes a logical unit indicating the whole of or part of an 
audio title. 

25 [0068] In the example of FIG. 4, each audio title set 
cell (ATS_C #) is composed of a set of 2048-byte audio 
packs (A_PCK). These packs are the smallest units in 
performing a data transfer process. The smallest unit in 
logical processing is a cell. Logical processing is done 

30 in cells. 

[0069] FIG. 6 shows an example of the data structure 
of the recorded contents (VTSTT_VOBS) in the DVD 
video zone 72 of FIG. 5. 

[0070] As shown in FIG. 6, the VTSTT_VOBS 7222 

35 explained in FIG. 5 defines a set of one or more video 
objects (AOB #). Each VOB defines a set of one or more 
video title set cells (VTS_C #). Each VTS_C defines a 
set of one or more video object units (VOBU). A set of 
one or more video title set cells constitutes a program. A 

40 set of one or more programs constitutes a program 
chain (PGC). The PGC constitutes a logical unit indicat- 
ing the whole of or part of a video title or visual menu. 
[0071] As shown in FIG. 6, each VOBU is a collection 
(a pack train) of a navigation pack, video packs (MPEG- 

45 compressed moving picture data), sub-picture packs 
(run-length-compressed bit map data), and audio packs 
(uncompressed linear PCM audio data or compressed 
multichannel audio data), with the navigation pack at the 
head. Specifically, the video object unit (VOBU) is 

so defined as a collection of all the packs starting from a 
navigation pack to the one just before the next naviga- 
tion pack. The navigation pack is incorporated in each 
VOBU to realize angle change (seamless angle change 
playback or nonseamless angle change playback). 

55 [0072] Those packs are used as the smallest units in 
transferring data as in FIG. 4. The smallest unit in logi- 
cal processing is a cell. Logical processing is done in 
cells. 
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[0073] The playback time of the VOBU corresponds to 
the playback time of the video data made up of one or 
more video groups (Groups of Pictures, abbreviated as 
GOPs) contained in the VOBU. The playback time is set 
in the range from 0.4 second to 1.2 seconds. In the 
MPEG standard, the playback time of one GOP is nor- 
mally about 0.5 second. One GOP contains screen data 
compressed so that about 15 pictures may be repro- 
duced in about 0.5 second. 

[0074] When the VOBU includes video data, GOPs 
(complying with the MPEG standard) composed of 
video packs, sub-picture packs, and audio packs are 
arranged to produce a video data stream. The VOBU is 
determined on the basis of the playback time of the 
GOPs, regardless of the number of GOPs. At the head 
of the VOBU. a navigation pack is always placed. 
[0075] In DVD video playback, even when the play- 
back data contains only audio and/or sub-picture data, it 
is constructed using a VOBU as one unit. For example, 
when a VOBU is made up of only audio packs, with a 
navigation pack at the head, the audio packs to be 
reproduced within the playback time (0.4 second to 1 .2 
seconds) of the VOBU to which the audio data belongs 
are stored in the VOBU, as in the video data VOBU. 
[0076] As shown in FIG. 6. the VTSTT_VOBS is 
defined as a set of one or more VOBs. The VOBs in the 
VOBS are used for the same application. A menu VOBS 
is usually composed of one VOB, in which menu screen 
display data items are stored. In contrast, a video title 
set VOBS is usually composed of more than one VOB. 
[0077] When a concert video for a certain rock band 
is taken as an example. VOBs constituting a video 
object set (VTSTT_VOBS) for title sets can be consid- 
ered to correspond to the video data for the perform- 
ance of the band. In this case, by specifying particular 
VOBs, for example, the third piece on the band s con- 
cert program can be reproduced. 
[0078] In the VOBs constituting a video object set 
(VTSM_VOBS) for menus, the menu data for all the 
pieces of the band s concert program is stored. Accord- 
ing to the menu on the screen, a specific piece of music, 
for example, an encore, can be reproduced. 
[0079] In an ordinary video program, one VOBS can 
be composed of one VOB. In this case, one video 
stream is completed with a single VOB. 
[0080] On the other hand, for example, in the case of 
a collection of animations with multiple stories or omni- 
bus movies, plural video streams [plural program chains 
(PGCs)] can be provided for each story in one VOBS. In 
this case, each video stream is stored in the corre- 
sponding VOB. At that time, the audio stream and sub- 
picture stream related to each video stream are also 
completed in each VOB. 

[0081] Each video object (VOB) is assigned an identi- 
fication number (#i; i = 0 to i). By the identification 
number, the VOB can be identified. A VOB is composed 
of one or more cells. An ordinary video stream is made 
up of plural cells. A video stream for menus may be 



composed of one cell. Like the VOB, each cell is 
assigned an identification number (#j; j = 1 to j). 
[0082] FIG. 8 shows the recorded contents in the 
user-accessible DVD audio zone 71 to help explain an 

s example of the data structure recorded on one side (of 
one or two layers) of the optical disk 10 shown in FIG. 1 . 
[0083] In DVD audio, a hierarchical structure com- 
posed of albums, groups, tracks, and indexes is pre- 
pared as a management structure for recorded contents 

w viewed from the software producer side. 

[0084] An album corresponds to one side of a DVD 
audio disk. For example. The first volume of works by 
Beethoven" can be allocated to the album. In this case, 
the album may be composed of group #1 of Symphony 

is No. 1 to group #9 of Symphony No. 9. 

[0085] Each group (e.g., group #1 ) is composed of the 
first to fourth movements of the corresponding sym- 
phony (Symphony No. 1). Each track is composed of 
indexes #1 to #i. which are obtained by dividing a track 

20 (e.g.. track #1) into i pieces. 

[0086] When the user plays back a DVD audio disk 
with such a hierarchical structure as is shown in FIG. 8, 
the user sets the optical disk 10 in the DVD audio player, 
operates the remote controller (not shown), and selects 

25 group #1 and track #1 . 

[0087] After the selection, when the user presses the 
playback button on the remote controller, the DVD audio 
player starts to reproduce Beethoven's symphony No. 1 , 
starting at the first movement. When the user specifies 

30 a specific index from the remote controller, the specified 
index portion is searched for and playback is started at 
that portion. The first index part of the first track in the 
first group in the album can be reproduced in default, or 
even when the user specifies nothing. 

35 [0088] In playing back a DVD disk, the user can rec- 
ognize the title (such as, the title of a specific movie), 
whereas in playing back a DVD audio disk, the user can- 
not see the title. What the user can see are only the 
album, groups, tracks, and indexes shown in FIG. 8. 

40 [0089] FIG. 9 shows the directory structure of the 
information (DVD audio and DVD video data files) 
recorded on the optical disk 10 shown in FIG. 1. The 
structure is an example of a file directory structure 
defined in the DVD file standard. 

45 [0090] As in a hierarchical file structure used by a gen- 
eral-purpose computer operating system, a subdirec- 
tory of video title set (VTS), a subdirectory of audio title 
set (ATS), and a user-defined directory are connected 
to a root directory. 

so [0091 ] Specifically, in the subdirectory of video title set 
(VTS), various video files (including VMGI, VMGM, 
VTSI, VTSM. and VTS files) as shown in FIG. 5 are so 
arranged that the individual files can be managed in 
order. 

55 [0092] Moreover, in the subdirectory of audio title set 
(ATS), various audio files (including AMGt, ATSt, and 
ATS files) as shown in FIG. 3 are so arranged that the 
individual files can be managed in order. 
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[0093] The user can access a specific file (for exam- 
ple, a specific VTS or a specific ATS) by specifying a 
path from the root directory to the file. 
[0094] When a DVD video player produced according 
to the DVD video standard plays back a DVD video disk s 
produced according to the DVD video standard, it first 
reads the management information (VMG) in the video 
title set (VTS) directory under the root directory and 
reproduces the video contents on the basis of the infor- 
mation. However, what can be reproduced according to 
the VMG is limited to the video contents (VTS) recorded 
in the VTS directory. 

[0095] On the other hand, when a DVD audio player 
(or a DVD video-DVD audio compattole player) pro- 
duced according to the DVD audio standard plays back 
a DVD audio disk produced according to the DVD audio 
standard, it first reads the management information 
(AMG) in the audio title set (ATS) directory under the 
root directory and reproduces the audio contents on the 
basis of the information. 

[0096] In this case, what can be reproduced according 
to the AMG is not limited to the audio contents (ATS) 
recorded in the ATS directory. The video contents (VTS) 
in the VTS directory can also be reproduced (the repro- 
ducing method will be explained later). 
[0097] FIG. 10 shows another example of the direc- 
tory structure of the information (DVD audio and DVD 
video data files) recorded on the optical disk 10 shown 
in FIG. 1. In the example of FIG. 9, the VTS directory 
and the ATS directory are placed in the same level of 
hierarchy under the root directory. On the other hand, in 
the example of FIG. 10. the ATS directory (child direc- 
tory) is placed in a level of hierarchy under the root 
directory (parent directory). The VTS directory (grand- 
child directory) is placed in a level of hierarchy under the 
ATS directory. 

[0098] FIG. 1 1 is a diagram to help explain the way the 
directory on the audio content side accesses a file in the 
directory on the video content side in the directory 
structure shown in FIG. 9. 

[0099] Specifically, in the hierarchical management 
structure for managing the data files recorded on the 
optical disk 10. the video title set directory (a child direc- 
tory) and the audio title set directory (a child directory) 
are placed under the root directory (a parent directory). 
[01 00] The video title set directory (VTS directory) is 
a directory for dealing with the video content files 
recorded on the optical disk 10 and includes a video 
manger (VMG) file and one or more video title set (VTS) 
files (video content logical units) (see FIG. 5). 
[01 01 ] The audio title set directory (ATS directory) is a 
directory for dealing with the audio content files 
recorded on the optical disk 10 and includes an audio 
manger (AMG) file and one or more audio title set (ATS) 
files (audio content logical units) as well as the afore- 
mentioned SAMG and ASVS (not shown in FIG. 11) 
(see FIG. 3). 

[01 02] The VMG in the VTS directory manages only 



the VTS and is designed to access only the VTS in the 
VTS directory. 

[0103] On the other hand, the AMG in the ATS direc- 
tory manages mainly the ATS and is designed to access 
not only the ATS in the ATS directory but also the VTS 
in the VTS directory. 

[0104] The AMG includes audio manger information 
(AMGI). The AMGI includes an audio title search pointer 
table (ATT_SRPT). The ATT_SRPT includes an audio- 
only title (AOTT) search pointer (ATT_SRP) and an 
audio video (AVTT) search pointer (ATT_SRP). The 
contents of these will be explained in detail later. 
[01 05] Specifically, the AMG in the ATS directory can 
access the audio title sets (ATS #1. ATS #2. ...) in the 
ATS directory using the AOTT search pointer 
ATT_SRPT. ft can also access the video title sets (VTS 
#1, VTS #2, ...) in the VTS directory using the AVTT 
search pointer (ATT_SRPT). This enables a certain 
object (such as, VTS #1) to be shared by both the video 
contents and the audio contents. This is one of the 
important characteristics of "an object sharing system" 
according to the present invention. 
[0106] FIG. 12 is a diagram to help explain a case 
where a file in the directory on the audio content side 
links with a file in the directory on the video content side. 
FIG. 12 can be considered to be a modification of FIG. 
11. 

[0107] Specifically, in the example of FIG. 11. the 
audio manager (AMG) is designed to be able to access 
both an audio title set (ATS) and a video title set (VTS). 
This enables a VTS to be shared by the video contents 
and audio contents. 

[0108] On the other hand, in the example of FIG. 12. 
information (e.g., a pointer indicating an address for a 
specific part of VTS #1) to link with a video title set 
(here, VTS #1) is written in an audio title set (here, ATS 
#1). This enables, for example, the audio data in VTS #1 
to be shared by the video contents and audio contents. 
[0109] FIG. 13 shows a data structure to help explain 
an example of how file access in FIG. 1 1 is carried out 
in the volume space 28 shown in FIG. 3 or 5. The data 
structure of FIG. 13 corresponds to the directory struc- 
ture of FIG. 1 1 . 

[01 10] In FIG. 13, the shaded portions indicate exam- 
ples of the contents shared by the video contents (or 
video volume) and the audio contents (or audio vol- 
ume). 

[0111] The basic idea of the data structure of FIG. 13 
is to record a recording area (VMG + VTS) for video 
contents and a recording area (AMG + ATS) for audio 
contents in the volume space 28 independently and 
enable video contents shared by both video and audio 
uses to be managed by the AMG. 
[01 12] Specifically, in FIG. 13. the video title set (VTS 
#1) managed by the VMG can access part (cells) of the 
video object set (VOBS #1) and the audio title set (ATS 
#1) managed by the AMG can access the other part 
(cells) of VOBS #1 . In this example, part of the cells con- 
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stituting the video object set (VOBS #1) of VTS #1 are 
shared by the video contents and audio contents. 
[01 1 3] In the data structure of FIG. 1 3, the DVD audio 
zone 71 is placed in locations with lower addresses 
(closer to the lead-in area 27 in FIG. 3). The DVD video 
zone 72 is placed in locations with higher addresses 
(closer to the lead-out area 26 in FIG. 3). In this case, 
the AMG has only to always use increasing addresses, 
when it accesses either ATS or VTS, and need not deal 
with decreasing addresses. This facilitate the construc- 
tion of the reproducing system. 
[0114] FIG. 1 4 shows a data structure to help explain 
another example of how file access in FIG. 1 1 is carried 
out in the volume space 28 shown in FIG. 3 or 5. 
Namely, FIG. 14 can be considered to be a modification 
of FIG. 13. 

[0115] In FIG. 13, because the DVD audio zone 71 is 
placed in locations with lower addresses and the DVD 
video zone 72 is placed in locations with higher 
addresses, the AMG need not handle decreasing 
addresses. 

[0116] On the other hand, in FIG. 14, the DVD video 
zone 72 is placed in locations with lower addresses 
(closer to the lead-in area 27 in FIG. 3). The DVD audio 
zone 71 is placed in locations with higher addresses 
(closer to the lead-out area 26 in FIG. 3). In this case, 
the AMG uses increasing addresses when it accesses 
the ATS and decreasing addresses when it accesses 
the VTS. In this case, addressing in accessing a desired 
object (a cell in the ATS or VTS) is troublesome. There- 
fore, it is difficult to apply the technique to commercial- 
use DVD audio players for which product costs are a 
problem. 

[0117] However, when a personal computer with a 
DVD drive is turned into a DVD audio player, the cost 
problem can be avoided even when the data structure of 
FIG. 14 has been employed. Specifically, the operating 
system (or the control software) of a personal computer 
whose data structure of FIG. 14 has been analyzed can 
convert the data structure with the physical arrange- 
ment of FIG. 14 into the arrangement of FIG. 13 in 
appearance by remapping the addresses on its mem- 
ory. By doing this, the MPU or CPU of the personal com- 
puter can access either ATS or VTS from the AMG by 
specifying increasing addresses. 
[01 18] FIG. 15 shows a data structure to help explain 
still another example of how file access in FIG. 1 1 is car- 
ried out in the volume space 28 shown in FIG. 3 or 5. 
Namely, FIG. 15 can be considered to be a modification 
of FIG. 13. 

[0119] In FIG. 13, because the DVD audio zone 71 is 
placed in locations with lower addresses and the DVD 
video zone 72 is placed in locations with higher 
addresses, the AMG need not handle decreasing 
addresses. 

[01 20] In contrast, in the data structure of FIG. 1 5. the 
AMG in the DVD audio zone 71 is placed in locations 
with lower addresses (closer to the lead-in area 27 in 



FIG. 3). The VMG in the DVD video zone 72 is placed in 
locations with higher addresses (closer to the lead-out 
area 26 in FIG. 3). In this case, the AMG has only to 
always use increasing addresses, when it accesses 

5 either the ATS and the VTS, and need not deal with 
decreasing addresses. Therefore, as in FIG. 13, it is 
easy to construct a reproducing system. 
[0121] Because the data structure of FIG. 15 is a 
nested structure where VTS #1 is placed in ATS #1, the 

10 VMG of FIG. 5 cannot recognize that the VTS in the ATS 
exists in the DVD video zone 72. In this case, the VMG 
can treat the VTS in the ATS as existing in the other 
recording areas 73. 

[0122] The data structure of FIG. 15 can be used 
is when the other recording areas 73 are used in a case 
where the AMG is allowed to access not only the ATS 
but also the VTS. 

[01 23] Three examples of the data structure that ena- 
bles the AMG to access not only the ATS but also VTS 
20 have shown in FIGS. 13 to 15. The most favorable one 
is the data structure of FIG. 13. The reason is that a 
desired common object can be accessed by just speci- 
fying increasing addresses without remapping the 
addresses. 

25 [0124] FIG. 16 is a diagram to help explain the 
recorded contents of the audio manager information 
(AMG I) in the DVD audio zone 71 shown in FIG. 3. 
[01 25] The contents the DVD audio zone 71 deals with 
include two types of titles: an audio-only title (AOTT) 

so and a video-added audio title [or audio-video title 
(AVTT)]. 

[0126] The AOTT is a title in an audio disk (A disk) 
without a video section and is defined in the ATS 
recorded under the audio title set directory. On the other 

35 hand, the AVTT is a title in an audio-video disk (AV disk) 
with a video section and is defined in the VTS recorded 
under the video title set directory. The AOTT and AVTT 
are generally called ATT (audio title). 
[01 27] The DVD audio zone 71 in which the ATT data 

40 is recorded is composed of SAMG 710. AMG 711, 
ASVS 712, and one or more (up to 99) audio title sets 
(ATS #1 to ATS #m) 713. 

[0128] The AMG 71 1 is composed of an audio man- 
ager information (AMG I) file 7110. an audio manger 
45 menu video object set (AMGM_VOBS) file (optional file) 
7111, and an audio manger intonation backup 
(AMGLBUP) file 7112. 

[01 29] The AMGI file 7110 includes an audio manager 
information management table (AMGI_MAT), an audio 

so title search pointer table (ATT_SRPT), an audio-only 
title search pointer table (AOTT_SRPT), an audio man- 
ger menu program chain information unit table 
(AMGM_PGCI_UT), and an audio text data manager 
(ATXTDT.MG). 

55 [0130] Specifically, the AMG 71 1 has two pieces of 
search information (ATT_SRPT) and (AOTT_SRPT). 
Here, the ATT_SRPT is a table in which search informa- 
tion for both the AOTT and the AVTT has been written. 
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The AOTT_SRPT is a table in which search information 
only for the AOTT has been written. 
[0131 ] The reason why the search information is not 
divided into AVTT search information and AOTT search 
information, but into ATT (the generic name for AOTT 
and AVTT) search information (ATT_SRPT explained 
later) and AOTT search information (AOTT_SRPT 
explained later) is to simplify the reproducing method for 
various types of DVD players. 

[0132] FIG. 17 shows the recorded contents of the 
audio manager information management table 
(AMGI_MAT) included in the audio manager information 
shown in FIG. 16. 

[0133] The audio manger information management 
table (AMGI_MAT) includes an audio manager identifier 
(AMGJD), an audio manager end address (AMG_EA), 
an audio manager information end address (AMGI_EA), 
the version number (VERN) of the standard employed 
by the optical disk (DVD audio disk) 10. a volume setting 
identifier (VLMSJD), autoplay information (APJNF), an 
audio still video set start address (ASVS_SA), the 
number of title sets (TS_Ns), a provider (software pro- 
ducer and/or seller) identifier (PVRJD), an audio man- 
ager information management table end address 
(AMGI_MAT_EA), an audio manager menu video object 
set start address (AMGM_VOBS_SA). an audio title 
search pointer table start address (ATT_SRPT_SA). an 
audio-only title search pointer table start address 
(AOTT_SRPT_SA), an audio manger menu program 
chain information unit table start address 
( AMGM_PGC LUT_SA) , an audio text data manager 
start address { ATXTDT_MG_SA) . a video attribute 
(AMGM_V_ATR) for an audio manager menu video 
object set, the number of sub-picture streams 
(AMGM_SPST_Ns) for an audio manager menu, an 
attribute (AMGM_SPST_ATR) for sub-pictures of an 
audio manager menu video object set, the number of 
audio streams (AMGM_AST_Ns) for an audio manager 
menu, an audio attribute (AMGM_AST_ATR) for an 
audio manager menu video object set, and other reser- 
vation areas. 

[0134] In the audio manager menu video object set 
start address (AMGM_VOBS_SA). the start address of 
the AMGM_VOBS is written in the relative number of 
blocks counted from the first logical block in the AMG. 
When no AMGM_VOBS is present "00000000 (h)" is 
written in the AMGM_VOBS_SA. 
[01 35] In the start address (ATT_SRPT_SA), the start 
address of the ATT_SRPT is written in the relative 
number of blocks counted from the first logical block in 
the AMG I. 

[0136] In the start address (AOTT_SRPT_SA), the 
start address of the AOTT_SRPT is written in the rela- 
tive number of blocks counted from the first logical Hock 
in the AMGI. 

[01 37] From the ATT_SRPT_SA or AOTT_SRPT_SA 
written in the AMGI_MAT of FIG. 17, it can be found on 
which part of the optical disk 10 the search pointer to 



the audio title (ATT_SRPT) or the search pointer to the 
audio-only title (AOTT_SRPT) has been written. 
[0138] FIG. 18 is a diagram to help explain the con- 
tents of the audio title search pointer table (ATT_SRPT) 

5 included in the audio manager information of FIG 16. 
The AMGI has two types of search pointers 
(ATT_SRPT) and (AOTT_SRPT). FIG. 18 shows a 
search pointer (ATT_SRP) that can access not only 
AOTT but also AVTT. 

w [0139] Specifically, the ATTJ5RPT included in the 
AMGI includes audio title search pointer table informa- 
tion (ATT_SRPTI) and one or more autfo title search 
pointers [ATT_SRP (ATT_SRP #1 to ATT_SRP #n)J. 
The ATT_SRPT1 includes the number of audio title 

is search pointers and the ATT_SRPT end address. 

[0140] FIG. 19 is a diagram to help explain the con- 
tents of each audio title search pointer (here. ATT_SRP 
#n) included in the search pointer table (ATT_SRPT) of 
the audio title shown in FIG. 18. 

20 [01 41 ] The DVD audio standard has been determined 
so as to deal with not only sound but also images. The 
AMG has two pieces of search information (ATT_SRPT) 
and (AOTT_SRPT). The ATT_SRPT of FIG. 19 is a 
table in which both AOTT search information and AVTT 

25 search information have been written. 

[0142] In FIG. 19. the audio-only title (AOTT) audio 
title search pointer (ATT_SRP) includes an audio title 
(ATT) category, the number of programs in the audio 
title (ATT), reservation, the total playback time of the 

30 audio title (ATT), the number of the audio title set (ATS), 
the title number of the audio title set (ATS), and the start 
address of the audio title set (ATS). 
[0143] The video-added audio title (AVTT) search 
pointer (ATT_SRP) includes an audio title (ATT) cate- 

35 gory, the number of programs in the audio title (ATT), 
the number of angles included in video, reservation, the 
total playback time of the audio title (ATT), the number 
of the video title set (VTS), the title number of the video 
title set (VTS), and the start address of the video title set 

40 (VTS). 

[0144] FIG. 20 is a diagram to help explain the con- 
tents of the audio-only title search pointer table 
(AOTT_SRPT) included in the audio manager informa- 
tion (AMGI) shown in FIG. 16. The AMGI has two types 
45 of search pointers (ATT_SRPT) and (AOTT_SRPT). 
FIG. 20 shows a search pointer (AOTT_SRP) that can 
access only the AOTT. 

[0145] Specifically, the AOTT_SRPT included in the 
AMGI includes audio-only title search pointer table infor- 
so mation (AOTT_SRPTI) and one or more audio-only title 
search pointers [AOTT_SRP(AOTT_SRP #1 to 
AOTT_SRP #m)J. The AOTT_SRPTI includes the 
number of audio-only title search pointers and the end 
address of the AOTT_SRPT. 
55 [0146] FIG. 21 is a diagram to help explain the con- 
- tents of an audio-only title search pointer (here. 
AOTT_SRP #m) included in the search pointer table 
(AOTT_SRPT) of the audio-only title shown in FIG. 19. 
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[0147] The DVD audio standard has been determined 
so as to deal with not only sound but also images. The 
AMG has two pieces of search information (ATT_SRPT) 
and (AOTT_SRPT). The AOTT_SRPT of FIG. 21 is a 
table in which only AOTT search information has been 
written. 

[0148] In FIG. 21, the audio-only title search pointer 
(AOTT_SRP) includes an audio title (ATT) category, the 
number of programs in the audio-only title (AOTT), res- 
ervation, the total playback time of the audio-only title 
(AOTT), the number of the audio title set (ATS) t the title 
number of the audio title set (ATS), and the start 
address of the audio title set (ATS). 
[0149] In the playback title control information defined 
in the audio manger (AMG), a title group (TT_GR) can 
be specified. 

[0150] The title group (TT_GR) is a collection of one 
or more audio titles (ATT) and is defined as a unit that 
assures continuous playback of ATT groups. From the 
user's viewpoint, the audio title (ATT) corresponds to a 
piece of music and the title group (TT_GR) corresponds 
to an album, a collection of pieces (see FIG. 8). With a 
record or a CD, when playback is started at the head of 
the album or in the middle of a piece, the album can be 
played back continuously until the end of the album has 
been reached. Similarly, when playback is started at the 
head of the TT_GR or in the middle of the ATT, the 
TT_GR can be played back continuously until the end of 
the TT_GR has been reached. 
[0151] The following two types can be defined as a 
title group (TT_GR). 

( A1 > Audio title group (ATT_GR): this ATT_GR is a 
title group (TTJ3R) composed of audio titles (ATT) 
defined in the audio title search pointer table 
(ATT_SRPT). 

<A2> Audio-only title group (AOTT_GR): this 
AOTTJ3R is a title group (TTjGR) composed of 
audio-only titles (AOTT) defined in the audio-only 
title search pointer table (AOTT_SRPT). 

[0152] The audio title group (ATT_GR) is tor a player 
that can reproduce images and sound complying with 
the audio standard (a player that handles both AOTT 
and AVTT). The audio-only title group (AOTT_GR) is for 
a player that can reproduce only sound conforming to 
the audio standard (a player that deals with only AOTT). 
[0153] The structure of the audio title (ATT) has the 
following three types: 

( B1 > ATT has only AOTT. 
( B2 > ATT has only AVTT 
<B3> ATT has both AOTT and AVTT. 

[0154] Here, AOTT and AVTT are the same in a piece 
of music, but AOTT is an pictureless version and AVTT 
is a picture-added version. 

[01 55] In the case of ( B1 > . the AOTT search informa- 



tion is written in both of the ATT_SRPT and 
AOTT_SRPT (see FIGS. 19 and 21). 
[01 56] In the case of < B2 > , the AVTT search informa- 
tion is written only in the ATT_SRPT (see FIG. 19). 
s [01 57] In the case of ( B3 > , the AOTT search informa- 
tion is written only in the AOTT_SRPT and the AVTT 
search information is written only in the ATT_SRPT (see 
FIG. 19). 

[01 58] FIG. 22 shows the relationship between < B1 ) , 
w <B2), and <B3>. FIG. 22 shows the relationship 
between the audio-only title group (AOTTJ3R) 
accessed using the audio-only title search pointer 
(AOTT_SRP) in the audio manager information (AMGI) 
of FIG. 16 and the audio title group (ATTJ3R) accessed 
is using the audio title search pointer (ATT_SRP) in the 
audio manager information (AMGI). It can be said that 
FIG. 22 shows an example of the relationship between 
ATT_SRPT and AOTT_SRPT 

[0159] In FIG. 22, the audio titles (ATT #1) and (ATT 

20 #9) are each composed of only video-added audio titles 
(AVTT). ATT #2 and ATT #8 are each composed of 
video-added audio titles (AVTT) and audio-only titles 
(AOTT). ATT #3 to ATT #7 are each composed of audio- 
only titles (AOTT). 

25 [0160] In FIG. 22, nine audio titles (ATT) are used. 
These are divided into four groups (GR #1 to GR #4), 
which constitute an audio title group (ATT_GR). The 
nine titles are divided into two groups (GR #1 , GR #2), 
which constitute an audio-only title group (AOTTJ3R). 

30 [01 61 ] In this example, the audio titles (ATT #1 ) and 
(ATT #9) are composed of only AVTT and include no 
AOTT. Consequently, ATT #1 and ATT #9 do not exist as 
the audio-only title group (AOTT_GR). 
[01 62] Therefore, the number (four in the example) of 

35 audio title groups (ATT_GR) generally does not coincide 
with the number (two in the example) of audio-only title 
groups (AOTT_GR). 

[01 63] What is important here is to keep the identity of 
the title group (TTJ3R) in both cases where ATT groups 

40 are reproduced on a player capable of reproducing 
images and sound complying with the audio standard 
(or a player that deals with both AOTT and AVTT) and 
where ATT group are reproduced on a player capable of 
reproducing only sound conforming to the audio stand- 

45 ard (a player that deals with only AOTT). 

[0164] Specifically, the corresponding ATTJ3R and 
AOTT_GR have to be composed of the same ATT even 
when they differ in the GR number and have the same 
order of ATT in the TT_GR. Otherwise, the user gets 

so confused. This does not apply to the ATT (ATT #1 and 
ATT #9 in FIG. 22) where only AVTT is present and no 
AOTT exists. 

[0165] To meet the above-described requirements, 
restrictions should be placed in such a manner that 
55 "ATT not defined as AOTT" is prevented from mixing 
with "ATT defined as AOTT in a one ATT_GR. This 
maintains the identity of TT_GR in a portion where both 
ATT_GR and AOTT_.GR exist 
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[0166] In the example of FIG. 22, each of ATT_GR #2 
and AOTTJ3R #1 . and ATTJ3R #3 and AOTTJ3R #2 
is composed of the same ATT and has the same order 
of ATT in the TT_GR. 

[0167] FIG. 23 is a diagram to help explain the 
recorded contents of the audio title set (ATS) in the DVD 
audio~zone 71 of FIG. 3. 

[0168] The audio title set (ATS) is composed of audio 
title set information (ATSi), an audio-only title audio 
object set (AOTT_AOBS), and audio title set information 
backup (ATSI_BUP). 

[0169] The audio title set information (ATSI) includes 
an audio title set information management table 
(ATSI_MAT) and an audio title set program chain infor- 
mation table (ATS_PGCIT). 

[0170] The audio title set program chain information 
table (ATS_PGCIT) includes audio title set program 
chain information table information (ATS_PGCITI), an 
audio title set program chain information search pointer 
(ATS_PGCI_SRP), and one and more pieces of audio 
title set program chain information (ATS_PGCI). 
[0171] FIG. 24 is a diagram to help explain the 
recorded contents of the audio title set information man- 
agement table (ATSI_MAT) of FIG. 23. 
[01 72] Specifically, the audio title set information man- 
agement table (ATSI_MAT) includes an audio title set 
identifier (ATSJD), the end address of the audio title set 
(ATS.EA), the end address of the audio title set infor- 
mation (ATSi_EA), the version number (VERN) of the 
employed audio standard, the end address of the 
audio title set information management table 
(ATS l_M AT_E A) , the start address (VTS_SA) of the 
audio-only title AOTT video title set (VTS). the start 
address of the audio-only title audio object set 
(AOTT_AOBS_SA) or the start address of audio-only 
title video object set (AOTT_VOBS_SA). the start 
address of the audio title set program chain information 
table ( ATS_PGCIT_SA), the attributes for the audio-only 
title audio object (AOTT_AOB_ATR) or the attributes for 
the audio-only title video object (AOTT_VOB_ATR) #0 
to #7. audio title set data mix coefficients 
(ATS_DM_COEFT) #0 to #15. and other reservation 
areas. 

[0173] When the ATS has no AOTT_AOBS. the start 
address of the VTS including the VTSTT_VOBS (see 
FIG. 6) for AOTT is written in the start address 
(VTS_SA) of the audio-only video title set. When the 
ATS has AOTT_AOBS, "OOOOOOOOh" is written in the 
VTS_SA. 

[0174] When the ATS has AOTT_AOBS, the start 
address of AOTT_AOBS is written in the 
AOTT_AO BS_S A in the relative number of logical 
blocks counted from the first logical block in the ATS. On 
the other hand, when the ATS has no AOTT_AOBS. the 
start address of VTSTT_VOBS is written in the 
AOTT_VOBS_SA in the relative number of blocks 
counted from the first logical block in the VTS including 
the VTSTT_VOBS used for the ATS. 



[0175] In the ATS_PGCIT_SA, the start address of 
ATS_PGCIT is written in the relative number of blocks 
counted from the first logical block in the ATSI. 
[0176] The number of the aforementioned 

5 AOTT_AOB_ATR or AOTT_VOB_ART prepared is 8. 
from #0 to #7. When the ATS has AOTT_AOBS, the 
attribute of the AOTT_AOB recorded in the ATS is writ- 
ten in the AOTT_AOB_ATR. On the other hand, when 
the ATS has no AOTT_AOBS. the attribute of the audio 

io stream in the VOB used in the AOTT.VOB in the ATS is 
written in the AOTT_VOB__ART. In the AOTT_AOB_ATR 
or AOTT_VO B_A RT, the employed sampling frequency 
(44 to 192 kHz) and the number of quantization bits (16 
to 24 bits) are written. 

is [0177] The ATS_DM__COE FT indicates coefficients 
used to mix down the audio data with a multichannel 
output (5. 1 channel output) to a two channel output and 
is used only in one or more AOTT_AOB recorded in the 
ATS. When the ATS has no AOTT_AOBS, "Oh" is written 

20 in all the bits in each of the 16 ATS_DM_COE FT (#0 to 
#15). The area for the 16 ATS_DM_COEFT (#0 to #15) 
are provided constantly 

[0178] FIG. 25 is a diagram to help explain the con- 
tents of the audio title set program chain information 
25 table (ATS_PGCIT) included in the audio title set infor- 
mation (ATSI) shown in FIG. 23. The recording position 
of the ATS_PGCIT is written in the ATS_PGCIT_SA in 
the ATSI_MATof FIG. 24. 

[01 79] As described earlier, the ATS_PGCIT includes 
30 audio title set program chain information table informa- 
tion (ATS_PGCITI), an audio title set program chain 
information search pointer (ATS_PGCI_SRP), and 
audio title set program chain information (ATS_PGCI). 
[0180] The ATS_PGCI_SRP includes one or more 
35 audio title set program chain information search point- 
ers (ATS_PGCI_SRP #1 to ATS_PGCI_SRP #D. The 
ATS_PGCI includes as many pieces of audio title set 
program chain information (ATS_PGCI #1 to ATS_PGCI 
#j) as ATS_PGCLSRP 
40 [0181] Each ATS_PGCI functions as navigation data 
for controlling the reproduction of the audio title set pro- 
gram chain (ATS_PGC). 

[0182] Here, the ATS_PGC is a unit to define an 
audio-only title (AOTT) and is composed of ATS_PGCI 

45 and one or more cells (cells in the AOTT_AOBS or cells 
in the AOTT_VOBS used as an object for the AOTT). 
[0183] Each ATS_PGCI includes general information 
on audio title set program chains (ATS_PGC_GI), an 
audio title set program information table (ATS_PGIT), 

so an audio title set cell playback information table 
(ATS_C_PBIT). and an audio title set audio still video 
playback information table (ATS_ASV_PBIT). 
[0184] The ATS_PGIT includes one or more pieces of 
audio title set program information (ATS_PGI #1 to 

55 ATS_PGI #k). The ATS_C_PBIT includes as many 
pieces of audio title set cell playback information 
(ATS_C_PBI #1 to ATS_C_PBI #k) as the ATS_PGI. 
[0185] FIG. 26 is a table showing the contents of the 
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audio title set program information (ATS_PGI) shown in 
FIG. 25. The ATS_PGI includes the contents of the 
audio title set program (ATS_PG_CNT), the entry cell 
number of ATS_PG (ATS_PG_EN_CN), the playback 
start time of the first audio cell in ATS_PG 
(FAC_ST_PTM), the playback time of ATS_PG 
(ATS_PG_PB_TM), the pause time of ATS_PG 
(ATS_PG_PA_TM), reservation for copy management 
information, and other reservations. 
[0186] The ATS_PG_CNT includes a description 
showing the relationship between the preceding pro- 
gram and the present program in terms of physical allo- 
cation, a description showing the relationship between 
the preceding program and the present program in 
terms of playback time stamp, a description (ATRN) 
showing the attribute of AOB or the attribute of the audio 
stream in the VOB in the ATS_PG, and a description 
(DM_COEFTN) showing a coefficient table number for 
effecting the down-mixing of an AOB in the ATS_PG 
(AOB_PG) having the number of AOTT_AOB_ART or 
AOTT_VOB_ART defined in the ATSI_M AT by using the 
number of ATS_DM_COEFT defined in ATSI_MAT 
[0187] The ATS_PG_EN_CN includes a description of 
the first ATS cell number (1 to 255) constituting an 
ATS_PG. 

[0188] The FAC_SA_PTM includes a description of 
the low-order 32 bits in the playback time stamp (or 
presentation time stamp PTS) written in the head audio 
packet in the first audio cell in the ATS_PG. 
[0189] The ATS_PG_PB_TM is a description of the 
total playback time of each cell in the ATS_PG. The total 
playback time (seconds) is obtained by dividing 
ATS_PG_PB_TM (32-bit data) by 90000. 
[0190] The ATS_PG_PA_TM is a description of the 
pause time able to be defined at the beginning of the 
ATS_PG. The pause time (seconds) is obtained by 
dividing ATS_PG_PA_TM (32-bit data) by 90000. 
[0191] FIG. 27 is a table showing the contents of the 
audio title set cell playback information (ATS_C_PBI) 
shown in FIG. 25. The ATS_C_PBI includes the index 
number (ATS_C_IXN) of a cell in the audio title set, the 
type of ATS_C (ATS_C_TY), the start address of ATS_C 
(ATS_C_SA), the end address of ATS_C (ATS_C_EA), 
and other reservations. 

[01 92] When the ATT has no AOBS, "0 1 h" is written in 
the ATS_CJXN. When the ATT has AOBS. the contents 
of ATS_CJXN are as follows according to the contents 
of the ATT_C. 

[0193] * When ATS C is a silent cell described earlier, 
"OOh" is written in ATS_CJXN as the index number for 
the ATS_C. 

[0194] * When ATS C is an audio cell described ear- 
lier, one of "1" to "99" is written in ATS_C_IXN as the 
index number for the ATS_C. 

[0195] The index number of the first audio cell (the 
one having the ATS_C with the lowest number exclud- 
ing the silent cell) is set at "1." A similar index number 
may be allocated to one or more ATS_C in the ATS_PG. 



[01 96] When the ATT has no AOBS, "0" is written in all 
the bits in the ATS_C_TY. On the other hand, when the 
ATT has AOBS, the structure of ATT_C 
(ATS_C_COMP) and its usage (ATS_C_Usage) are 
5 written in the ATS_C_TY 

[01 97] Specifically, when the cell is an audio cell com- 
posed only of audio data, "00b" is written in 
ATS_C_COMP (2 bits). 

[0198] When the cell is an audio cell composed of 
io audio data and real-time information, "01 b" is written in 
ATS_C_COMP (2 bits). 

[01 99] When the cell is a silent cell composed only of 
silent audio data, "10b" is written in ATS_C_COMP (2 
bits). 

75 [0200] In the ATS_CJJsage, the data "0001 b" indicat- 
ing such usage as "a spotlight section" for highlighting 
(spotlighting) a specific portion of the audio manager 
menu (AMGM) displayed is written. 
[0201] When the ATS has AOTT_AOBS. the start 

20 address of ATS_C expressed in the relative logical block 
number counted from the first logical block in the 
AOTT_AOBS in which ATS__C has been recorded is 
written in the ATS_C_SA. 

[0202] On the other hand, when the ATS has no 
25 AOTT_AOBS, the start address of ATS_C expressed in 
the relative logical block number counted from the first 
logical block in the AOTT_VOBS in which ATS_C has 
been recorded is written in the ATS_C_SA. 
[0203] When the ATS has AOTT_AOBS, the end 
30 address of ATS_C expressed in the relative logical block 
number counted from the first logical block in the 
AOTT_AOBS in which ATS_C has been recorded is 
written in the ATS_C_EA. 

[0204] On the other hand, when the ATS has no 
35 AOTT_AOBS, the end address of ATS_C expressed in 
the relative logical block number counted from the first 
logical block in the VTSTT_VOBS in which ATS_C has 
been recorded is written in the ATS_C_EA. 
[0205] FIG. 28 is a diagram showing the contents 
40 of the audio title set audio still video playback 
information table (ATS_ASV_PBIT) shown in FIG. 
25. The ATS_ASV_PBIT includes audio title set 
program audio still video playback information 
search pointers (ATS_PG_ATS_PBI_SRP #1 to 
45 ATS_PG_ASV_PBI_SRP #m) and audio title set audio 
still video playback information (ATS_ASV_PBI #1 to 
ATS_ASV_PBI #n). Here, n and m meets the expres- 
sion: n * m s 99. 

[0206] FIG. 29 is a table showing the contents of the 
so audio title set program audio still video playback infor- 
mation search pointers (ATS_PG_ASV_PBI_SRP). The 
ATS_PG_ASV_PBLSRP includes the number 
(ASVUN) of audio still video units (ASVU), the display 
mode (ASV_DMOD) of audio still video (ASV), the start 
55 address (ATS_ASV_PBI_SA) of audio title set audio still 
video playback information (AST_ASV_PBI), and the 
end address (ATS_ASV_PBI_EA) of audio title set 
audio still video playback information (ATS_ASV_PBI). 
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[0207] The ATS_ASV_PBI includes display lists 
(ASV_DLIST #1 to ASV_DLIST #k: k * 99) for audio still 
video (ASV). 

[0208] FIG. 30 is a block diagram of an apparatus (a 
DVD player) for reproducing the recorded information in 
the DVD audio zone 71 of FIG. 3 or the recorded infor- 
mation in the DVD video zone 72 of FIG. 5 from the opti- 
cal disk (DVD audio disk) 10 of FIG. 1. The reproducing 
apparatus has the configuration of a DVD video/DVD 
audio compatible player capable of performing not only 
audio playback but also video playback. Explanation of 
the concrete configuration will not be given. The player 
may be compatible with the current CD playback. 
[0209] The reproducing apparatus for the optical disk 
10 shown in FIG. 30 includes a remote controller 5 that 
receives the user's operations, a remote controller 
receiving section that receives the operating state of the 
remote controller 5, a key input section 4 that receives 
the user's operations on the reproducing apparatus 
body side, and a panel display section 4B that is pro- 
vided on the reproducing apparatus body (or/and the 
remote controller 5) and informs the user of the result of 
the user's operation and the playback state of the opti- 
cal disk 10. The other external units include a monitor 
section 6 and a speaker section 8L/8R. The speaker 
section is for two-channel stereo. To effect multichannel 
playback, as many speaker systems and driving ampli- 
fiers for the speakers as are needed for multichannel 
playback must be prepared. 

[0210] The key input section 4, panel display section 
4B. remote controller 5. and monitor section 6 constitute 
a visual user interface. The monitor section 6 is used 
not only as a playback image monitor for still-picture- 
added DVD audio disks, but also as display means, 
such as an on-screen display (OSD). The monitor sec- 
tion 6 is not limited to a direct-view display, such as a 
CRT display, a liquid-crystal display, or a plasma display, 
and may be a video projector that projects various 
images (such as a menu screen, stilt pictures showing 
the state of the recording spot, and others) including the 
OSD information on a large screen. 
[021 1] Information on the user's operations from the 
remote controller 5 is sent via a remote controller 
receiving section 4 A to the microcomputer (MPU or 
CPU) 500 of a system control section 50 that controls 
the operation of the entire reproducing apparatus. The 
control section 50 includes a ROM 502 in which a con- 
trol program and others to be executed by the MPU 500 
have been stored. 

[021 2] Information on the user's operations from the 
key input section 4 is sent directly to the MPU 500. The 
MPU 500 displays the operating state of the reproduc- 
ing apparatus (various setting states and playback infor- 
mation on the DVD disk) according to information on the 
user's operations on the panel display section 4B. 
[0213] A RAM 52 and a memory interface (memory 
l/F) 53 are connected to the MPU 500. The input/output 
control of the RAM 52 is carried out via the memory l/F 



53. The MPU 500 uses the RAM 52 as a work area. On 
the basis of various processing programs stored in the 
ROM 502. the MPU 500 controls the operations of a 
disk drive section 30. a system processor section 54. a 
5 video decoder section 58. an audio decoder section 60, 
a sub-picture decoder section 62. and a D/A converting 
& reproducing section 64. 

[0214] The disk drive section 30 not only rotates the 
optical disk 10 set in the tray (the inside of the DISK 

io TRAY INLET of FIG. 31) of the reproducing apparatus 
body, but also reads the recorded data (audio data 
including voice/music information and. if recorded on 
the optical disk 10, main picture data/video data includ- 
ing moving picture information/still picture information. 

15 and sub-picture data including subtitle informa- 
tion/menu information) from the optical disk 10. The disk 
drive section 30 subjects the read-out data to signal 
processes, including signal demodulation and error cor- 
rection, and converts the processed data into data 

20 strings in pack form (see FIGS. 4 and 6). The resulting 
data is sent to the system processor section 54. 
[0215] The system processor section 54 has a packet 
transferring section (not shown) that judges the types of 
various packets included in the data reproduced from 

25 the optica! disk 10 and delivers the data items in the 
packet to the corresponding one of the decoders (58. 
60. 62). 

[0216] The packet transferring section segments the 
pack-form data string from the disk drive section 30 by 
30 the type of pack (such as, navigation pack, video pack, 
sub-picture pack, audio pack, or real-time information 
pack). An ID data item indicating the transfer time data 
item and the type of data is recorded in each of the seg- 
mented packs. 

35 [021 7] Referring to the transfer time data item and the 
ID data item, the system processor section 54 transfers 
video packs, sub-picture packs, and video packs to the 
video decoder section 58. sub-picture decoder section 
62, and audio decoder section 68. respectively. An 

40 audio pack or a real-time information pack correspond- 
ing to a silent cell is sent to the audio decoder section 
60. 

[0218] The system processor section 54 transfers the 
control data in the navigation pack to the RAM 52 via 
45 the memory i/F 53. The MPU 500, referring to the trans- 
ferred control data in the RAM, controls the playback 
operation in each section of the reproducing apparatus 
body. 

[0219] The video decoder section 58 decodes the 
so video data MPEG-encoded in the video pack trans- 
ferred from the system processor section 54 and cre- 
ates the uncompressed image data. 
[0220] The sub-picture decoder section 62 decodes 
the sub-picture data run-length-compressed in the sub- 
55 picture pack transferred from the system processor sec- 
tion 54 and creates the uncompressed bit map sub-pic- 
ture data. The sub-picture decoder section 62 includes 
not only a sub-picture decoder for decocfing the sub-pic- 
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ture data from the system processor section 54 but also 
a highlighting section (in the case of DVD video) for 
decoded sub-picture data. 

[0221] The sub-picture decoder expands the pixel 
data (including highlighted pixels, pattern pixels, and s 
background pixels) in units of a specific number of bits 
(two bits) run-length-compressed according to a spe- 
cific rule and restores the original bit map image. 
[0222] The highlighting section performs a corre- 
sponding highlighting process according to the X-Y w 
coordinate values indicating the rectangular area in 
which the highlight information (e.g., the choices on a 
menu), color codes, and highlight color/contrast value 
supplied from the MP U 500. 

[0223] The highlighting process can be used as 75 
means for enabling the user to easily recognize a dis- 
played specific item (the operator for selecting the type 
of reproduced spoken language and the type of lan- 
guage used in reproduced subtitles or the operator for 
selecting a specific item, such as the sampling fre- 20 
quency of the reproduced sound, the number of quanti- 
zation bits, or the number of playback channels) on a 
visual user interface on the monitor section 6. 
[0224] When the color and contrast of each pixel in the 
decoded sub-picture data are changed according to the 25 
highlight information, the changed sub-picture data is 
supplied to the image combining section (not shown) of 
a video processor section 640. The image combining 
section combines the decoded image data with the 
highlighted sub-picture data. The resulting image is dis- 30 
played on the monitor section 6. 
[0225] The RAM 52 includes a menu table for storing 
the start addresses of a sub-picture menu, an audio 
menu, an angle menu, and a chapter (program) menu. 
To highlight a specific part of these menus, the highlight- 35 
ing process is used. 

[0226] The audio decoder section 60 decodes the 
audio data in the audio pack transferred from the sys- 
tem processor section 54 and creates audio data for 
two-channel stereo or multichannel stereo. When the 40 
audio data in the audio pack is compression-encoded 
data (such as MPEG or AC-3), the audio decoder sec- 
tion 60 also decodes the data. 

[0227] The image data (normally, moving-picture sig- 
nals) decoded by the video decoder section 58 and the 45 
sub-picture data (normally, the bit map data on subtitles 
and menus) decoded by the sub-picture decoder sec- 
tion 62 are transferred to a video processor 640. The 
video processor 640 mixes the image data- and the 
sub-picture data in a specific ratio to produce the final so 
analog image signals (composite video signals, sepa- 
rate S signals, or component signals Y/Cr/Cb) and out- 
puts these signals to the monitor section 6. 
[0228] When the image data decoded by the video 
decoder section 58 is the main part of the movie on a 55 
DVD video disk, the sub-picture data is usually the sub- 
titles in the language selected by the user. The monitor 
section 6 displays the main part of the movie with subti- 



tles. 

[0229] When the image data decoded by the video 
decoder section 58 is the menu section of the movie, 
the sub-picture data serves as the characters constitut- 
ing menus and user select operators (subjected to the 
highlighting process, when necessary). In this case, the 
background (still picture or moving picture) of the menu 
is displayed according to the image data and the opera- 
tors whose representations are changed by the user's 
select operation are displayed on the background 
screen according to the sub-picture data. 
[0230] On the other hand, when the image data 
decoded by the video decoder section 58 is a still pic- 
ture on a DVD audio disk, the sub-picture data is, for 
example, an explanatory text in the language selected 
by the user. In that case, the still picture with text is dis- 
played on the monitor section 6. 
[0231] The video processor section 640 includes an 
OSD section that generates display data for on-screen 
display. The user's operations from the remote control- 
ler 5 or the like are processed by the MPU 500. The 
result of the processing is sent from the MPU 500 to the 
OSD section of the video processor 640. The OSD sec- 
tion generates image data corresponding to the result of 
processing from the MPU 500 and sends the image 
data to the monitor section 6 in analog image signal 
form. 

[0232] In other words, the video processor section 640 
converts the digital signals from the video decoder sec- 
tion 58 and sub-picture decoder section 62 into analog 
signals and multiplexes them. 

[0233] A frame memory section 642 is connected to 
the video processor section 640. The frame memory 
section 642 is used not only to multiplex the pictures of 
the image data and the pictures of the sub-picture data 
but also to provide an n -partition (e.g., 4-partrtion) multi- 
screen display. 

[0234] When chapter searching is done, the frame 
memory section 642 can fix part of the images from the 
video decoder section 58 as still pictures and use them 
when sending the still pictures to the monitor section 6 
until the target chapter starts to be reproduced. 
[0235] When the display corresponding to the result of 
the user's operation is made by the OSD. the frame 
memory section 642 can be used in multiplexing the 
image data with the OSD display. 
[0236] The audio data decoded at the audio decoder 
section 60 is transferred to a DAC & output circuit 644. 
The DAC & output circuit 644 converts the audio data 
(digital) from the audio decoder section 60 into the cor- 
responding analog audio signal, amplifies the analog 
audio signal suitably, and outputs it to the speaker sec- 
tion 8L78R. 

[0237] When the multichannel audio is down-mixed to 
two channels on the basis of the contents of 
ATS_DM_COE FT in the audio title set information man- 
agement table (ATSI_MAT) shown in FIG. 24, the MPU 
500 sends the down-mix coefficient (parameter) to the 
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DAC & output circuit 644. Then, on the basts of the coef- 
ficient received, the DAC & output circuit 644 down- 
mixes the multichannel audio data decoded at the audio 
decoder section 60 into two channels and outputs two 
channel analog audio signals. 5 
[0238] The video processor section 640. frame mem- 
ory section 642. and DAC & output circuit 644 constitute 
the D/A converting/reproducing section 64. 
[0239] Each of the system processor section 54. video 
decoder section 58. audio decoder section 60. and sub- io 
picture decoder section 62 includes a register for tem- 
porarily storing a system time clock (STC) used to know 
the operation timing and the instructions and pieces of 
information from the system MPU 500. 
[0240] FIG. 31 shews an example of the front panel of is 
the reproducing apparatus shown in FIG. 30. The front 
panel is provided with a fluorescent display section (FL 
display) 4B corresponding to the panel display section 
4B shown in FIG. 30. 

[0241] On the FL display 4B of FIG. 31. an album 20 
name and/or a group name are displayed in characters 
according to the audio text data manager ATXTDT_MG 
in the AMGI. Taking FIG. 8 as an example. "THE FIRST 
VOLUME OF WORKS BY BEETHOVEN* is displayed 
as an album name and "SYMPHONY NO. 1" is dis- 25 
played as a group name. 

[0242] On the left numeral display section of the FL 
display 4B, a title number (in the case of DVD video) or 
group number (in the case of DVD audio), a track 
number, and index number are displayed. 30 
[0243] When the optical disk 1 0 set in the disk tray of 
FIG. 31 is an AV disk (a disk with ATT_SRP of FIG. 19), 
the "AV DISK" part is highlighted as shown in the figure 
on the character display section somewhat in the middle 
and on the right side of the FL display 4B. When the 35 
optical disk 10 set is an A disk (a disk with AOTT_SRT 
of FIG. 21 ). the "A DISK" part is highlighted on the right 
character display section of the FL display 4B. When the 
optical disk 10 set is a video disk with only VTS and no 
ATS (a disk without the ATS directory of FIG. 11). the 40 
"VIDEO DISK" part is highlighted on the right character 
display section of the FL display 4B. 
[0244] Furthermore, on the right numeral display sec- 
tion of the FL display 4B, the sampling frequency and 
the number of quantization bits in the-audio contents to 45 
be reproduced are displayed. The display can be made 
automatically on the basis of AOTT_AOB_ATR or 
AOTT_VOB_ART in the audio title set information man- 
agement table ATSLMAT. 

[0245] The following two types of DVD audio player so 
that plays back a DVD audio disk (A disk or AV disk) can 
be considered: 

(CD Player capable of reproducing images and 
sound complying with the audio standard. ss 
<C2> Player capable of reproducing only sound 
complying with the audio standard. 



[0246] The <C1 >-type player has only to read the 
search information (FIG. 19) written in the ATT_SRPT 
for playback of contents. 

[0247] On the other hand, the ( C2 > -type player has 
only to read the search information (FIG. 21) written in 
the AOTT_SRPT for playback of contents. 
[0248] By doing this, the reproducing method on each 
type of player is simplified. Because the <C2>-type 
player cannot reproduce ATT #1 and ATT #9 of FIG. 22 
because they have no AOTT. 

[0249] The reproducing apparatus of FIG. 30 is a 
player of the < Cl ) type. The operation of the player 
playing back the optical disk 10 with the data structure 
of FIG. 13 will be explained. 

[0250] When the optical disk 1 0 with the data structure 
of FIG. 13 is played back on an ordinary DVD video 
player, the video player reads the VMG in the VTS direc- 
tory under the root directory of FIG. 1 1 and. on the basis 
of the information, determines the title to be repro- 
duced. Then, according to the instruction given by the 
playback unit defined in the VTS corresponding to the 
determined title, all of or part of the object set (VOBS #1 
or VOBS #2) of FIG. 13 is reproduced. 
[0251] In the data structure of FIG. 13. the video 
player recognizes the parts excluding VMG. VTS #1 , 
and VTS #2 as the other recording areas 73 (see FIGS. 
3 and 5). Therefore, no matter what type of data has 
been written in the parts recognized as the other record- 
ing areas 73. this has no effect on the video player 
reproducing VOBS #1 and VOBS #2. In this case, the 
video player cannot reproduce the objects present in the 
other recording areas 73. 

[0252] On the other hand, when the optical disk 10 
with the data structure of FIG. 13 is played back on a 
DVD audio player of FIG. 30, the audio player reads the 
AMG in the ATS directory under the root directory of 
FIG. 1 1 and reproduces the contents on the basis of the 
information. In title specification using AMG, not only 
the playback unit defined in the ATS recorded in the 
DVD audio zone 71 (see FIG. 3) but also the playback 
unit defined in the VTS recorded in the DVD video zone 
72 (see FIG. 5) can be specified. 
[0253] The playback unit defined in the ATS can spec- 
ify not only the playback route of the object (AOBS #1 or 
AOBS #2) recorded in the DVD audio zone 71 but also 
the playback route of the audio data recorded in the 
object (e.g.. VOBS #1) in the DVD video zone 72. 
[0254] The VOBS #1 marked with slanted lines in FIG. 
13 indicates part of the DVD video shared by the DVD 
audio side. Here, the arrow (A) indicates the case where 
the playback unit in the DVD video zone 72 is referred 
to. The arrow (B) indicates the case where the playback 
unit in the DVD audio zone 71 refers to the audio part of 
the object (VOBS #1) in the DVD video zone 72. 
[0255] When the audio part of the object (VOBS #1 ) in 
the DVD video zone 72 is referred to by the playback 
unit in the DVD audio zone 71 . the common reference 
part (the part shared by the DVD audio and DVD video) 
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can be defined differently from each unit (such as cells, 
programs, or program chains) defined by the definition 
information (VTSI) in the playback unit in the DVD video 
zone 72, on the basis of the definition information (ATSI) 
in the playback unit This is because the video player 5 
may differ from the audio player in the reproducing 
method, although the objects are the same (see FIG. 7). 
[0256] The shared part is used by using a video object 
unit VOBU as a unit. The reason is that the unit in which 
an audio data stream and other (video and sub-picture) io 
data streams are each packed and time-division-multi- 
plexed is a VOBU. 

[0257] As shown in FIG. 1 3, by placing the DVD audio 
zone 71 physically in front of the DVD video zone 72, all 
the addresses of the playback units specified in the indi- 75 
vtdual pieces of the management information can be 
limited to only ascending address specification. This 
simplifies the design and development of audio players. 
[0258] The operation of a video player in the data 
structure of FIG. 15 is the same as in FIG. 13. The oper- so 
ation of an audio player in the data structure of FIG. 1 5 
is almost the same as in FIG. 13. The audio player 
jumps to the head of the AMG. reads the management 
information, and reproduces the audio object sets 
(AOBS #1 and AOBS #2). The AOBS #1 is an object in 25 
the DVD video zone 72. Using ATSI #1. cells, programs, 
and program chains in AOBS #1 are defined again. In 
AOBS #1, VOBU is used as a unit. 
[0259] In the above embodiment, the case where the 
DVD audio data and/or DVD video data included in the 30 
volume space is recorded on the optical disk 10 has 
been explained. However, the data structure of the 
present invention (see FIGS. 3 to 29) is not limited to the 
case where the data is recorded on the optical cfisk 10. 
For instance, bit streams including the data having the 35 
structures shown in FIGS. 3 and 1 1 may be used in dig- 
ital broadcasting or digital communications. In this case, 
electromagnetic waves or communication lines function 
as mediums. Moreover, communication terminals, such 
as DVD broadcasting receivers, or personal computers, <o 
function as DVD audio players. 

[0260] Systems to which the present invention is appli- 
cable have been generally explained. The points a 
stress is laid on in the invention will be explained in 
order. 45 
[0261] The main point is that the cell structure in DVD 
audio is given a characteristic. First, there are the fol- 
lowing two types of DVD audio, depending on the type 
of data dealt with: 

so 

[A-1] Audio with Video: a system that handles both 
audio data and video data. 

[A-2] Audio without Video: a system that handles 
only audio data and deals with no video data. 

55 

[0262] The data structure of type [A-1] is basically the 
same as that in the DVD standard. What the present 
invention deals with is related to the audio data struc- 



ture of type [A-2]. The optical disk 10 for the DVD audio 
system has the structure as explained earlier. All of the 
one side of the optical disk 1 0 is defined as a volume. A 
title group (TT_GR) is a component element of a vol- 
ume and composed of one or more audio titles (ATT). 
The TT_GR compares to an album in a record or a CD. 
It is assured that track groups in one TT_GR can be 
reproduced continuously. 

[0263] There are the following two types of ATT: 

[B-1] Audio with Video Title (AVTT): a title made up 
of audio data and video data. 
[B-2] Audio Only Title (AOTT): a title made up of 
only audio data. 

[0264] The AVTT and AOTT are generally called ATT. 
As described above, since the present invention deals 
with the data structure of type [A-2], explanation of 
AOTT will be given. One AOTT is made up of one PGC. 
More specifically, as shown in FIG. 3, one AOTT is com- 
posed of the program chain information (ATS_PGCI) in 
the ATS and one or more cells in the audio object set 
(AOBS) in the corresponding ATS. 
[0265] A track is a program (PG) defined in the PGC. 
One track is composed of one PG. The track is made up 
of one or more cells. 

[0266] Generally, in the audio contents, a track is used 
as a unit in separating pieces of music. A cell is used as 
a unit in separating the numbers in a piece of music. 
The playback of the audio contents is defined by speci- 
fying the playback sequence of cells. 
[0267] The following specifications are required for the 
audio data structure of type [A-2]: 

[C-1] The attribute of audio data has to be able to 
be set track by track. 

[0268] About [C-1 J: in a music CD, the attributes 
(including the sampling frequency fs and the number of 
quantization bits Qb) of each piece of music in one 
afoum are all the same. In DVD audio, however, 
attributes are allowed to be set piece by piece to 
increase the degree of freedom of the sound source. 
Specifically, the content provider can set attributes track 
by track. The attributes for each track in DVD audio 
include sampling frequency, the number of quantization 
bits, channel assignment, and down-mix coefficient. 
[0269] As described above, when the audio data ful- 
filling the specification in item [C-1] is reproduced on a 
DVD player, there arises a sound break problem at the 
start of playback of tracks. However, from the viewpoints 
of contents, the sound break should be managed by the 
producer. Moreover, as descrtoed above, the length of a 
sound break should be the same, regardless of a player 
with or without a video reproducing function. 
[0270] Therefore, according to the present invention, 
there is provided a data structure which enables the 
producer to set the length of sound break time by him- 
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serf and which enables a player to determine whether a 
sound break is present or not in the playback procedure 
defined by the producer and realizes the setting of the 
sound break time set by the producer. 
[0271 J A sound break may occur in a place where the s 
attributes of two tracks making a playback transition dif- 
fer from each other. Specifically, when the player repro- 
duces a track with an attribute and the attribute of the 
next track to be reproduced differs from the preceding 
one. the player has to make various settings [including w 
the setting of buffers affected due to the difference in the 
number of quantization bits, the setting of the clock 
(sampling) frequency, and the setting of the number of 
channels). During the settings, the data transfer is 
stopped and therefore a break in the sound takes place, is 
Naturally, the sound break does not occur when the 
attributes of two tracks making a playback transition are 
the same. Consequently, a sound break may or may not 
take place in one title group (TT_GR). 
[0272] Since the sound break is ascribed to a physical 20 
cause, it cannot be solved in terms of the application- 
level data structure. Therefore, the system of the 
present invention positively admits the existence of a 
sound break described above and constructs such a 
data structure as allows the content provider to manage 2s 
the sound break time length. The resulting form gives 
no unnatural feeling to the user when it is reproduced. 
[0273] The types of cells of audio data are defined as 
follows: 



[D-1] Audio cell (A_C): a cell composed of ordinary 
audio data. 

[D-2) Silent cell (SI_C): a cell composed of silent 
audio data. 
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[0274] Then, the data identifying information to iden- 
tify the contents of cell components is added to the ceil 
information, thereby making it possfole to discriminate 
between the two types of cells. Here, silent means not 
that audio data does not exist but that there is audio 40 
data with an amplitude of zero. 

[0275] FIG. 32 shows the classification of two types of 

cells. 

[0276] An audio cell (A_C) in a audio data cell 
includes no still picture data. A silent cell (SI_C) corre- 45 
sponds to a special case in an audio data cell. All the 
audio data items in the silent cell are silent. The silent 
cell is used to set and manage the length of time of 
silence. 

[0277] One ATS_PG is composed of one or more so 
ATS_C. The ATS_PG corresponds to a track and the 
ATS_C corresponds to a cell. 

[0278] FIGS. 33A and 338 show two ways in which 

ATS_C are arranged in the ATS.PG. 

[0279] Specifically, in [E-1] ATS_PG. only A_C are ss 

arranged. 

[0280] In [E-2] ATS_PG, the first cell is SI_C and the 
second and later cells are all A_C arranged in 



sequence. 

[0281] All the ATS_C constituting one ATS_PG meet 
the following conditions: 

[F-1] All the ATS_C constituting one ATS_PG are 
physically consecutive in the arrangement 
[F-2] The presentation time stamps (PTS) in all the 
ATS_C constituting one ATS_PG are consecutive. 
[F-3] At least one A_C exists in one ATS_PG. 
[F-4] The presentation time of one A_C is one sec- 
ond or longer. 

[F-5) The audio attributes of all the SI_C constitut- 
ing one ATS_PG and those of the A_C group are 
the same. 

[F-6] The presentation time of one SI_C is 0.5 sec 
or longer. 

[0282] FIGS. 34, 35. and 36 are diagrams to help 
explain AOTT_AOB and AOTT_AOBS. In the figures. 
A_PAK means an audio pack, and PTI_PAC means a 
real-time information pack. 

[0283] As described above, the target title is AOTT of 
the type described in item [B-2]. Therefore, an audio 
object, the substance of the data, is an Audio Object for 
Audio Only Title (AOTT_AOB). The AOTT AOB is com- 
posed of one or more ATS_C. Each ATS_C is com- 
posed of pack groups. 

[0284] The data included in the AOTT_AOB is audio 
data. The audio data includes silent audio data (with an 
amplitude level of zero as described earlier). It also 
includes a little additional data of non-image [this is 
called real-time information data], such as text data, as 
a special example in the form of RTI packs. 
[0285] The AOTT_AOB has to include audio data. All 
the attributes of the audio data in one AOTT_AOB have 
to be the same. The still picture data is included option- 
ally in the AOTT_AOB. The still picture included in one 
program (PG) has to be outputted before the audio data 
in the program is reproduced. 

[0286] One AOTT_AOB is one program stream or part 
of the stream written according to the system part 
of the MPEG-2 standard (ISO/IEC 13818-1). The 
AOTT_AOBS is a collection of AOTT_AOB. As defined 
in [D-1] and [D-2], the following two types of cells are 
defined in one AOTT_AOB. 

[G-1] An audio cell (A_C) is composed of only audio 
data packs groups (see FIG. 34) or of audio data 
pack groups and additional non-image data (RTI 
data) pack groups (see FIG. 35). Its presentation 
time is one second or longer. 
[G-2] A silent cell (SI__C) is composed of only silent 
audio data pack groups (FIG. 36) and is used to set 
a silent period. The presentation time for one SI_C 
is 0.5 second or longer. 

[0287] The following two cases can be considered in 
connection with the relationship between two adjacent 
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PGs on the PGC: 

[1-1] A PG and the preceding PG have the same 
attributes. 

[1-2] A PG and the preceding PG have different 
attributes. 

[0288] In the PG, the attributes of the audio data the 
cells in the PG have and the PG information to define 
the temporal relationship with the preceding PG are 
written. By recognizing the contents and information on 
the cell arrangement (playback sequence) in the PG, 
two states in item [M] and item [I-2] can be recognized 
easily on a player. 

[0289] FIGS. 37A to 37C and FIGS. 38A to 38C show 
the relationship between PTS and playback time in two 
adjacent PGs in the two cases described in item [1-1] 
and [I-2]. 

[0290] FIGS. 37A to 37C show a case in item [1-1]. 
FIGS. 37A, 37B, and 37C illustrate the arrangement 
(PG) of audio packs on a track, the values of presenta- 
tion time stamps (PTS). and the passage of playback 
time, respectively. In this case, the consecutive transfer 
of audio data streams is maintained and the continuity 
of audio playback is also maintained. The PTS is reset 
at the beginning of the PG. 

[0291] In this case, the first cell in the PG might be 
SI_C but the state remains unchanged. The reason is 
that SI_C is a kind of audio cell and corresponds to a 
special case where all the audio data items have an 
amplitude level of zero. 

[0292] FIGS. 38 A to 38C show a case in item [I-2]. 
FIGS. 38A. 38B. and 38C illustrate the arrangement 
(PG) of audio packs on a track, the values of presenta- 
tion time stamps (PTS), and the passage of playback 
time, respectively. In this case. too. the - consecutive 
transfer of audio data streams is maintained. Since 
hardware resetting is necessary in changing the 
attributes, audio playback is discontinued during the 
resetting. 

[0293] In this case, the content producer can manage 
the intervals of silent time by setting the first cell in the 
PG to SI_C. Specifically, the content producer can set 
the intervals of silent time at will by setting the length of 
SI_C (0.5 sec or longer). 

[0294] Although PTS are written discontinuously 
between adjacent PGs in FIGS. 37A to 37C and FIGS. 
38A to 38C, the PTS may be continuous because the 
audio data streams in both PGs are continues. 
[0295] The two types of cells will be described system- 
atically as follows. First, as shown in FIG. 3, the DVD 
audio zone 71 is composed of a simple audio manager 
(SAMG) file, an audio manager (AMG) file, an audio still 
video set (ASVS) file, and an audio title set (ATS) file. 
The audio title set (ATS) is composed of an audio title 
set information (ATSI) file, an audio-only title audio 
object set (AOTT_AOBS) file, and an audio title set 
information backup (ATS_BUP) file. 



[0296] As shown in FIG. 23, the audio title set informa- 
tion (ATSI) is composed of an audio title set information 
management table (ATS_MAT) file and an audio title set 
program chain information table (ATS_PGCIT) file. 

5 [0297] The ATS__PGCIT is composed of an audio title 
set program chain information table information 
(ATS_PGCITI) file, an audio title set program chain 
information search pointer (ATS_PGCI_SRP) file, and 
an audio title set program chain information 

10 (ATS_PGCI) file. 

[0298] As shown in FIG. 25. the ATS_PGCI is com- 
posed of an audio title set program chain general infor- 
mation (ATS_PGC_GI) file, an audio title set program 
information table (ATS_PGIT) file, an audio title set ceil 

is playback information table (ATS_C_PBIT) file, and an 
audio title set audio still video playback information table 
(ATS_ASV_PBIT) file. 

[0299] An item is set as a variable in each piece of 
audio title set cell playback information (ATS_C_PBI) 
so written the audio title set cell playback information table 
(ATS_C_PBIT). The variable is the audio title set cell 
type (ATC_C_TY) shown in FIG. 27. This specifies 
which one of the following items the cell falls under: 

25 [J-1] An audio cell (A_C) composed of only audio 
data. 

[J-2] An audio cell (A_C) composed of audio data 
and real-time information. 

[J-3] A silent cell (SI_C) composed of only silent 
30 audio data with an amplitude level of zero. 

[0300] Furthermore, "OOh" is specified for variable 
(ATC_CJXN) when the cell is Sl_C. When the cell is 
A_C, the index number (in the range of 1 to 99) of the 
35 cell is specified. 

[0301 ] Because the player knows the types of the indi- 
vidual cells from these pieces of information, it can rec- 
ognize the presence or absence of a break in sound 
beforehand. 

40 [0302] FIG. 39 shows a reproducing apparatus for 
playing back the DVD audio disk. Because the repro- 
ducing apparatus is a unit for reproducing only audio 
data, it has no system for processing video data and 
sub-picture data, as compared with the apparatus of 

45 FIG. 30. When a disk on which image data has been 
recorded is played pack, the reproducing apparatus 
simply ignores the image data periods. 
[0303] Specifically, even when the image data has 
arrived, the system processor section 54 does not 

so transfer the data to the audio decoder section 60. When 
the silent cell data has arrived, it transfers the data as 
audio data to the audio decoder section 60. The remain- 
ing sections are almost the same as those in FIG. 30. 
[0304] Although in the above embodiment the image 

55 data has been ignored completely, a terminal 54-1 for 
separating and extracting only the image data may be 
provided on the system processor section 54. With this 
configuration, the user can use a disk with images that 
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he or she bought by. for example, playing back the 
audio disk at home and supplying the image data to the 
decoder input terminal of a DVD player. An audio output 
terminal may. of course, be added. 
[0305] With this reproducing apparatus, when the 
real-time information has been recorded on a disk, the 
data can be demodulated at the system control section 
50 or a separately provided demodulating section and 
displayed on the panel display section 4B. In this case, 
it is desirable that the panel display section 46 should 
have, for example, a liquid-crystal screen. Various keys, 
including a ten-key pad. are used as the key input sec- 
tion 4. 

[0306] Either type of disk playback apparatus must 
have the function of identifying the cell type. 
[0307] FIO. 40 shows a reproducing apparatus capa- 
ble of playing back a disk on which images have been 
recorded. When image data has been recorded on the 
disk, the reproducing apparatus reproduces the data 
and displays it on the monitor 6. When a disk of another 
type is played back, the reproducing apparatus oper- 
ates as the apparatus of FIG. 39 does. 
[0308] While the present invention has been explained 
using the recording medium and the disk playback 
apparatus, it may be applied to a case where the audio 
information defined as described above is transmitted 
via a transmitting unit and received by a receiving unit 
and thereafter reproduced. Furthermore, the invention 
may be applied to a case where a control signal for real- 
izing the function of receiving and processing the afore- 
mentioned audio information is transmitted to a 
receiving unit and thereafter the audio information 
defined as described above is read from a transmitting 
or recording medium and reproduced. 

Industrial Applicability 

[0309] As described above, the present invention pro- 
duces the following effect. The audio attributes can be 
specified track by track. This causes a sound break 
problem, which is ascribed to a break in the necessary 
time for resetting the hardware environment of the 
player and in the audio output as a result of the change 
of the audio attributes. 

[0310] To solve this problem, audio cells and silent 
cells have been defined and the arrangement of them 
been limited. The introduction of such a concept ena- 
bles the content producer to positively manage and set 
the sound break time. For example, when tracks with 
sound breaks mingle with tracks with no sound break, 
silent periods of time can be standardized in any track 
by placing a silent cell at the head of each track with no 
sound break. This prevents the mixture of tracks with 
and without sound breaks from giving unnatural feeling 
to the user. 



Claims 

1. An audio data structure characterized in that, in 
audio contents which have cells for defining at least 

5 an audio title playback unit and whose actual play- 
back sequence is determined by defining the play- 
back sequence of the cells, identification 
information to specify the types of the cells accord- 
ing to the difference in the contents of the data 

w included in said cells is included in cell information 
to specify said cells. 

2. The data structure according to claim 1 . character- 
ized in that one type of the contents of the data in 

is said cells is for obtaining the length of a silent 
period of time and the identification information cor- 
responding to the cell indicates a silent cell. 

3. The data structure according to claim 1 , character- 
20 ized in that said identification information defines 

audio cells composed of ordinary audio data as 
a first type, and 

silent cells composed of only audio data with 
25 an amplitude level of zero as a second type, 

and 

identifies the types of the cells. 

4. An audio data structure characterized in that, in a 
30 track that defines the playback sequence of one or 

more cell groups. 

an arrangement of audio cells composed of 
ordinary audio data items is allowed as a first 

35 arrangement, and 

an arrangement of silent cells composed of 
only audio data items with an amplitude level of 
zero and audio cells composed of ordinary 
audio data items is allowed as a second 

40 arrangement. 

5. An audio data structure characterized in that, in a 
track that defines the playback sequence of one or 
more cell groups. 

45 

the audio attributes of the cells constituting said 
track and track connection information indicat- 
ing the relationship in playback time between 
the present track and the preceding track are 
so provided between tracks. 

6. The data structure according to claim 5. character- 
ized in that said audio attributes include at least 
sampling frequency, the number of quantization 

55 bits, channel assignment, and down-mix coeffi- 
cient. 



7. The data structure according to claim 3, character- 
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ized in that the presentation time for said audio cells 
is set to one second or longer. 

J. The data structure according to claim 3, character- 
ized in that the presentation time for said silent cells s 
is set to 0.5 second or longer. 

h A disk playback apparatus characterized by com- 
prising 

w 

an information recording medium which has 
cells for defining at least an audio title playback 
unit, audio contents whose actual playback 
sequence is determined by defining the play- 
back sequence of the cells, and an audio data is 
structure where identification information to 
identify the types of the cells according to the 
difference in the contents of the data included 
in said cells is included in cell information to 
specify said cells, with at least silent data exist- so 
ing in said cells and said identification informa- 
tion including silent cell identification 
information to identify said silent data cell, and 
means for reading the recorded information on 
said information recording medium and, when 25 
recognizing said silent cell identification infor- 
mation, supplying the audio contents inte- 
grated in the identification information to an 
audio decoder section. 

30 

0. The disk playback apparatus according to claim 9, 
characterized in that said identification information 
defines audio cells composed of ordinary data as 
cells of a first type and said silent cells composed of 
only audio data with an amplitude level of zero as 35 
cells of a second type and discriminates between 
these cells on the basis of cell time information, and 

said playback apparatus includes means for 
judging said cell type information and. when 40 
judging that picture cells have been inputted, 
ignoring the input and, when judging that silent 
cells and audio cells have been inputted, sup- 
plying the data in these cells to an audio 
decoder for playback. 45 

1. A transmission apparatus characterized by trans- 
mitting such information as includes audio contents 
which have cells for defining at least an audio title 
playback unit and whose actual playback sequence so 
is determined by defining the playback sequence of 
the cells, and causes identification information to 
identify the types of the cells according to the differ- 
ence in the contents of the data included in said 
cells to be included in cell information to specify ss 
said cells, one type of the contents of the data in 
said cells being for obtaining the length of the silent 
period of time and the identification information cor- 



responding to the cell incficating a silent cell. 

12. The transmitting apparatus according to claim 11, 
characterized in that the period in which said silent 
cell is transmitted is set between programs. 

13. A receiving apparatus characterized by comprising 
means which receives such information as includes 
audio contents which have cells for defining at least 
an audio title playback unit and whose actual play- 
back sequence is determined by defining the play- 
back sequence of the cells, and causes 
identification information to identify the types of the 
cells according to the difference in the contents of 
the data included in said cells to be included in cell 
information to specify said cells, one type of the 
contents of the data in said cells being for obtaining 
the length of the silent period of time and the identi- 
fication information corresponding to the cell indi- 
cating a silent cell and which, when recognizing the 
identification information indicating said silent cell, 
supplies the cell integrated in the identification infor- 
mation to an audio decoder section for playback. 

14. A recording medium on which audio contents are to 
be recorded, the audio contents having cells for 
defining at least an audio title playback unit and 
making actual playback sequence determined by 
defining the playback sequence of the cells, said 
recording medium characterized by recording iden- 
tification information to identify the types of the cells 
according to the difference in the contents of the 
data included in said cells in such a manner that the 
identification information is included in cell informa- 
tion to specify said cells. 

15. The recording medium according to claim 14, char- 
acterized in that one type of the contents of the data 
in said cells is for obtaining the length of the silent 
period of time and the identification information cor- 
responding to the cell indicates a silent cell. 

16. The recording medium according to claim 14, char- 
acterized in that said identification information 
defines audio cells composed of ordinary data as 
cells of a first type, and 

silent cells composed of only audio data with 
an amplitude level of zero as cells of a second 
type and identifies the types of the cells. 

17. A recording medium characterized in that, in a track 
defining the playback sequence of one or more cell 
groups. 

an arrangement of audio cells composed of 
ordinary audio data items is allowed as a first 
arrangement, and 
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an arrangement of silent cells composed of 
only audio data items with an amplitude level of 
zero and audio cells composed of ordinary 
audio data items is allowed as a second 
arrangement for recording. 

18. A recording medium characterized in that in a track 
defining the playback sequence of one or more cell 
groups. 



10 



the audio attributes of the cells constituting said 
track and track connecting information indicat- 
ing the relationship in playback time between 
the present track and the preceding track are 
recorded between tracks. 75 

19. The recording medium according to claim 18, char- 
acterized in that said audio attributes include at 
least sampling frequency, the number of quantiza- 
tion bits, channel assignment, and down-mix coeffi- 20 
cient. 

20. The recording medium according to claim 16, char- 
acterized in that the presentation time of said audio 
cells is set at one second or longer. 25 

21. The recordinq medium according to claim 16. char- 
acterized in that the presentation time of said silent 
cells is set at 0.5 second or longer. 

30 
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